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© Vector. 

© The invention relates to vectors which include an inducible selection gene and a sequence which 
codes for a heterologous polypeptide. In a particular example the selection gene comprises the tetA and 
tetR genes. The vector may also include a sequence which is capable of conferring stability on the 
vector, such as the cer sequence, a transcription terminator, such as gene 32 from bacteriophage T4, a 
ribosome binding site, and a multi-cloning site. Hosts transformed with the vector, processes for 
preparing these hots, and processes for preparing polypeptides using these hosts. 
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v ec tL h L P dn S n inVen ! i0n relateS t0 3 VeCt ° r WhiCh inClUd6S 30 induCible se,ection 9 ene - a host containing the 
vector and processes for preparing the vector and host. 

m^LTr? 0 ' b f teria ' clonin 9 and expression vectors contain an antibiotic resistance marker as a simple 
5 of the f^T n S6 r 1 ° r Pl3Smid in the baCt6rial h ° St ° f Choice " This is «"«»•* ampicil.in because 
IIhI 7 T °" 9 ' nal C '° nin9 VeCtore cons tructed, pBR322, carries an ampicillin resistance deter- 

JTSS^SSSJSS i^f l h 95 " 1 13) „° ne of the derivatives of this plasmid is pAT153 < Tw ** and *~ 

,o fid.Swr 6 '? 6 ' 6 *" SyStemS h3Ve b6en US6d in Clonin 9 vectors fortne Preparation of polypep- 
w tides, there is still a need for improved selection systems. 

According to the present invention there is provided a vector which comprises an inducible selection gene 
and a sequence which codes for a heterologous polypeptide. 

"T 0r ° 38 used herein is used in ite braadest sense and includes within its meaning any replicon 
« r! L W f I™ 9 recombinant DNA material from one cell to another. The present inventionlncludes vec- 

for tTnSf T 9 k " 2? 3 ^ a " d VeCt ° re ' SUCh 38 Plasmids ' which are ^ constructing vectors 
for transfer of recombinant DNA to hosts. a 

tion Inn T d H Cib !K," S ?! Cti °!1 9ene " ( ° r S6leCti0n mark6r) Wi "' in 9eneral ' include a 9ene which facilitates selec- 
ted.^ set^n h ,nduClble * e,ection 9 ene ma V comprise a first gene which codes for a substance which 
20 occurt Z^Tr T 1?°°^ WhlBh ex P ression of that substance so that expression only 

whth f r ^ 1 " S - ThUS ' f ° r 6Xample the fi,St 9ene maV COm P rise a 9 ene which codes fora sub^ 
reiser Tn ^ ,T 40 a " 8ntibi0tiC 3nd the S6C ° nd 9ene may com P" se a 9ene which codes for a 

to coZ JlSl *Sf ant,b '° tiC thS SySt6m iS induCed and ex P r ession of the first gene takes place 

Zr t nZ?T Tk !-; thUS PermittinQ recombinant ^tors which carry the selection gene to be selec- 
25 and h^.S? °1 k rePreSSi ° n °° CUrS S ° th3t ex P r ession of the first gene does not take place 

25 and the substance wh.ch confers antibiotic resistance in the vector is not produced 

Th-f Part,CU,ar examp,e of a suitable induc ible selection gene is one which includes the tetA and tetR genes 
li^f ? " 9e " e WhlCh f ° r 3 substance whic h confers resistance to tetracycline; whlisFthe tetR 

fnZ nZ^rr OT r te,n W K hiCh iS ablS l ° PreVent eXpreSSi ° n ° f the «A 9ene. The system is induct 
in the presence of tetracycline, so that in its presence the tetA gene is expressed thus confening tetracycline 
30 resistance on the vectors which carry the selection gene. In the absence of tetracaine the XS 
ressedbymet^genesothatexpressionof^does not take 

ible L h |^rn rt Joo a r h em h b0diment of * e P resent inve "tion there is provided a vector which comprises an induc- 
es pofypeptkle C ° mPnSeS ~ ~ 96neS ' ^ & WhiCh ^ f ° r 3 he terologous 

MrZ^JlT- T' ' n ad ? i0n t0 thS Sequences mentioned above include other DNA sequences suitable for 
SSfnil k P h T' SU ? 38 appr0priate contra « sequences. For example the vector may include a promoter 
r 9 , a " d « anSCT W° n teminator sequence. The vector will, in general, include an origin o 
replication, for example that derived from plasmid pAT1 53 

W For px^'^TITk 16 ° f a SU j tab,e pramoter is the tryptophan (trpj promoter. Other promoters may be used. 
For example ,n a further embodiment of the present invention the vector includes the T7A3 promoter (SEQ ID 
NO 42), ,n which case the vector may optionally also include an operator such as lacO (especially the shortened 
SSLtSTJ °k N ? ^ T7A3 Pr ° m0ter S6qUenCe Sh ° Wn in SEQ ,D N0 42 is shown upt thf 

. tj^szzs^ used with the ,aco ™ - - ~ - « - 

fo und terminator is a derivative of the transcription terminator se£ - uence 

th* trlT enCe W , hiCh *° nferS Stabi ' ity °" the VeCt ° r may alSO be P resent An example of such a sequence is 
the cer sequence (see, for example, Cell, 36, 1 097-1 1 03, 1 984) quence is 

50 P nrI h !.nH Ct0r "7 in ^ de , a rnulticlonin 9 sequence to facilitate the introduction of ribosome binding sequ- 
ences, and genes for heterologous polypeptides etc. 

The heterologous polypeptide may comprise a polypeptide which possesses pharmacological properties 

me orenrj T * « SU ° h * P °' yPeptide ' S riCi " A Which ^ 

the preparation of immunotoxins. A further example is a polypeptide known as G-CSF or an analogue thereof 

a h ? 6 ,S St,mUlatin9 f3Ct0r (G " CSF) h3S been described in the literature by Wallet K et 7proc 

^69 strand PPtI V° p 2 k, PP 1526 - 153 ° aPd haS alS ° b6en d6SCribed in Eur °P ean Pa »ent Rubicon No 
169 566 and PCT Patent Pubhcation No WO 87/01 1 32. G-CSF has been shown to stimulate granulocyte pro- 
duction mvfvo and to function with minimal side effects. As a result human G-CSF is seen as having pofenSl 
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utility in the management of neutropaenia associated with chemotherapy, radiation therapy, radiation accident 
or autologous bone marrow transplantation. Moreover G-CSF may have utility in the stimulation of bone marrow 
suppression associated with AIDS, in the treatment of myelodys plastic syndromes characterised by granulo- 
cyte functional abnormalities and as an adjunct to the treatment of severe infections. 

The term "human G-CSF" as used herein refers to those G-CSFs that have been found to exist in nature 
and includes the two polypeptides having the amino acid sequence set out in SEQ. ID No 41. These two 
polypeptides differ only in so far as a tripeptide insert Val-Ser-Glu is present in one polypeptide between posi- 
tions 35 and 36, but absent in the other. The numbering used throughout the present specification is based on 
the naturally occurring polypeptide without the Val-Ser-Glu insert. 

Analogues of G-CSF include polypeptides which differ from that of naturally occurring G-CSFs in terms of 
the identity or location of one or more amino acid residues. For example, such analogues may contain sub- 
stitutions, or terminal or intermediate additions or deletions of such residues. Such analogues would share the 
property of natural G-CSFs of being able to stimulate granulocyte production. 

In particular the present invention provides a replicable expression vehicle which comprises a vector as 
hereinbefore defined. 

In a particular embodiment of the present invention there is provided a replicable plasmidic expression ve- 
hicle which comprises an inducible selection gene comprising the tetA and tetR genes, and a DNA sequence 
which codes for a heterologous polypeptide. 

As indicated above, the expression vehicle may include a sequence capable of conferring stability on the 
expression vehicle, such as the cer sequence. 

In a further embodiment of the present invention there is provided a vector which comprises a replicable 
plasmidic expression vehicle comprising a promoter, the cer sequence, a transcription terminator as found in 
the terminus of gene 32 bacteriophage T4, an origin of replication and a DNA sequence which codes for a 
heterologous polypeptide. 

According to the present invention there is also provided a process for preparing a polypeptide, said pro- 
cess comprising culturing a host which comprises a vector of the present invention so that the polypeptide is 
expressed. 

The process may be carried out in the absence of the product used in selection. For example, in the case 
where the selection system comprises the tetA and tetR genes, the process may be carried out in the absence 
of tetracycline. 

The above-mentioned process may be effected by the use of any appropriate host cell, such as bacterial, 
yeast, or mammmalian cells. A particular example of a suitable host comprises bacterial cells, for example E. 
coli . 

It will be appreciated that where the desired metabolite is not passed out of the host at a useful rate, the 
host may be cultured and harvested as the intact cell and the desired polypeptide recovered by subsequently 
extracting the cells, for example after separation from the medium containing nutrients necessary for growth 
of the host cell. Where the metabolite is passed out of the host cell into the surrounding culture solution, then 
the polypeptide may be recovered by extraction in the normal way. 

According to the present invention there is also provided a host capable of expressing a heterologous 
polypeptide, which host comprises a vector (such as replicable plasmidic expression vehicle) as herein defined. 

In a particular embodiment of the present invention there is provided a host transformed with a replicable 
plasmidic expression vehicle which comprises an inducible selection gene comprising the tetA and tetR genes, 
and a DNA sequence which codes for a heterologous polypeptide. 

According to a further feature of the present invention there is also provided a process for the preparation 
of a host as hereinbefore defined, said process comprising transforming a host by the insertion therein of a 
vector (such as a replicable plasmidic expression vehicle) as hereinbefore defined. 

Suitable methods for the introduction of foreign genetic material into a host are known from the literature. 
Such methods include formation of a replicable expression vehicle comprising a vector and the foreign genetic 
material, and introduction of the vehicle into the host. Introduction of the vehicle into the host may be facilitated 
by subjecting the host to an appropriate treatment, for example in the case of E. coli , by treatment with calcium 
chloride solution. 

The present invention also provides a process for the preparation of a vector as herein defined comprising 
inserting a sequence which codes for the desired polypeptide into a vector (as herein defined) at an appropriate 
insertion site so that a vector (conveniently in the form of a replicable plasmidic expression vehicle) is obtained 
which is capable of directing synthesis of the polypeptide. 

According to a further aspect of the present invention there is provided a vector which comprises an induc- 
ible selection gene. The selection gene may be as defined hereinbefore, for example it may comprise the tetA 
and tetR genes. 
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larlv adtaSnl? the Pres f t nt inv « ntion utilises an '"duclble selection gene. This has been found to be particu- 

- m wmsmm 

J S.'SSr ™ V,sST °' ""' PreSS "' i nVen '""' ' S e " em l" ffiM <* >"=">°« «*** carries the mamr 
T " aS une *e e < :te <% 'ound to have gained stability over lie parent r>AT153~even 

20 Pl ™rztn7iCnwh^ 

Figure 1 illustrates transcription terminator sequences; 

Figure 2 illustrates the preparation of pTB344; 

Figure 3 illustrates the preparation of plCI0042; 
25 Figure 4 is a plasmid map of plCI0042; 

Figure 5 describes fragments used in the preparation of plasmids- 

Figure 6 is a plasmid map of plCI1079; 

Figure 7 is a plasmid map of pLB015; ' 

Figure 8 is a plasmid map of pCGG1; 
30 Figure 9 describes the sequence of [Ser 17 . 27 ]G-CSF; 

Figure 10 describes the sequence of h-GCSF; 

Figure 1 1 is a plasmid map of pCG54; 

Figure 12 illustrates the construction of plCI0020; 

Figure 13 illustrates the construction of plCI1078; 
35 Figure 14 illustrates the construction of pICM 1 02- 

K ~mSSra?2^ 9d " E " CO,i ' ySateS " A iS P ' CI1102: B > 

Figure 16 illustrates a gel profile of pIC1 11 02 in which peak R represents ricin A 

2^7* 1 ,S 3 WeSt6m b ' 0t ° f riCi " A PrOdU0ed by p,CM 102 and 'nwhichtracklismolecularweightmankers- 
2 and 3 are non-r.cn producing clones; 4 is p.CM 102; and 5 is p IC .0020 (control plasmid " 20^ seq ^ 

Figure 18 is a partial sequence of plCI1102; and 
Figure 19 illustrates the contraction of plCI1187 

BUFFERS FOR RESTRICTION ENZYMES 

Stability: stable at -20°C. 
so Buffer composition: 



55 
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Buffer components Final concentration in mmol/1 





(1:10 diluted 


set 


buffer) 




B 


M 


H 


Tris-HCl 


10 


10 


50 


MgCl 2 


5 


10 


10 


NaCl 


100 


50 


100 


Dithioerythritol (DTE) 




1 


1 


2-Mercaptoethanol 


1 






pH at 37°C 


8.0 


7. 


5 7.5 



15 

The above buffers are available from Boehringer Mannheim. 
In the site-directed mutagenesis procedure - Example 7 
Buffer 1 1 00 mM Tris HCI pH 8.0 
20 100mMNaCI 
20 mM MgCI 2 
Buffer 2 10 mM Tris HCI pH 8.0 
20 mM NaCl 
1 mM EDTA 
25 Buffer 3 12 mM Tris HCI pH 7.7 
30 mM NaCl 
10 mM MgCI 2 
8 mM 2-mercapto ethanol 
Buffer 4 60 mM Tris HCI pH 8.0 
30 90 mM NaCl 

6 mM MgCI 2 
10 mM DTT 

Nucleotide mix 1 250 nM each of dATP, dGTP, dCTP=S (phosphorothioate derivative of dCTP), dTTP and 1 
mM ATP 

35 Nucleotide mix 2 250 uM each of dATP, dGTP, dCTP, dTTP and 350 nM ATP 

M9 minimal media 

40 Ammonium chloride lg 

Disodium hydrogen or thophosphate 6g 

Potassium dihydrogen or thophosphate 3g 

Sodium chloride 0.5g 

In distilled water 1 1. 

Supplements/75ml 

so 300 \i\ 50% glucose 

75^1 1MMgS0 4 

75pil 0.1MCaCI 2 

75 uJ 4 mg/ml thiamine 

75 |il 20% casein amino acids 

55 
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Trace Element Solution (TES) 

TES has the following composition: 
mg/10 ml 
deionised water 



10 



A1C1 3 .6H 2 0 2.0 

CoCl 2 .6H 2 0 0.8 

KCr(S0 4 ) 2 .12H 2 0 0.2 

CuCl 2 .2H 2 0 0.2 

,5 H 3 B0 3 0.1 

KI 2.0 

MnS0 4 .H 2 0 2.0 

20 NiS0 4 .6H 2 0 0.09 

Na 2 MoO A .2H 2 0 0.4 

ZnS0 4 .7H 2 0 o.A 

25 Geneclean (TM) 



55 



inn J!L kit ""l?'"! 1 ? ^ SOd, ' Um i0dide 2) 3 "'ncentrated solution of sodium chloride. Tris and EDTA for mak- 

7slT:ZT:^T anol,water wash: 3) G,assmilk (TM) - a 1 - 5 ml vial contai ^ 1 25 - <* • 

30 ce J^ i of a ^ C !! | n ? Ue ?I °T purification based on the m *hod of Vogelstein and Gillespie published in Pro- 
ceedings of the National Academy of Sciences USA (1979) Vol 76 p 615 

sJS^T ?w 6 meth ° dS deSCnbed in " Molecu,ar Caning - a laboratory manual" Second Edition 
Sambrook, Fntsch and Maniatis (Cold Spring Harbor Laboratory, 1989) can be used. 

35 Random Label Kit Product of Pharmacia No 27-9250 

PriJJ 6 T^""* iS deSCribed in Molecular Cloning - a Laboratory Manual" Second Edition Sambrook 
Fntsch and Maniatis, PP 10.13-10.17 (Published by Cold Spring Harbor Laboratory 1989) Samb ™^ 

40 Sequenase (TM) 

Chemically modified T7 DNA polymerase 

. JIZZSXEZSZ tZSSZX"** 0 " pub " shod " " ProcMdln9s NMtonal *— - 

T4 DNA ligase 

so 5 60 D «f 2f m.Vr l™?^,?*™ 9 ' 3 Laborator y Manual " Second Edition, Sambrook, Fritsch and Maniatis 
^3 ^541(1968) " P 9 Harb ° r Lab ° rat0ry 1989> 3lSO by W6iSS B • * * J - Bio '- Chem ^ 

E.coli strains 



tnr J he ^ strains HB101 a nd CGSC 6300 (also referred to herein as MSD522) are freely available Thus 

all HB1 01 may addrt.onally be obtained from for example BRL supplied by GIBCO Limited Unit 4 Co vTv Mm 
Trad.ng Estate. Longbridge Way, Uxbridge. UB8 2YG, Middlesex, England or GIBC^ Laboratories, Life 
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Technologies Inc., 31 75 Staley Road, Grand Island, NY 14072, USA. The genotype of strain HB1 01 is described 
in the aforementioned "Molecular Cloning - A Laboratory Manual" as Sup E44 hsd S20 (r B ~ m B ")rec A 13 ara-14 
Fleu 6 thi-1 proA2 lac Y1 gal K2 rps L20 xy!"5 mtl"1. The genotype of MSD 522 (CGSC 6300) is set out in 
Example 3. 

5 The following non-limiting Examples are given by way of illustration only. 

EXAMPLE 1 

Preparation of plasmids containing trp promoter/ tetA/tetR genes. 

10 

(a) Preparation of plCI0042 

Many plasmid vectors are based on one of the original cloning vectors: pBR322 (Bolivar et al , 1977, Gene 
2: 95-1 13). The non-mobilizable pAT153 is a derivative of this (Twigg and Sherratt, 1980, Nature 283 : 216-218). 
15 Both these plasmids contain the ampicillin resistance determinant, TEM p-lactamase. 

Plasmid plCI0042 utilises a repressed tetracycline resistance determinant, as found on the naturally-occur- 
ing plasmid RP4. This repressed system shuts off expression of the tetA gene in the absence of tetracycline 
whereas most drug resistant mechanisms have constitutive expression. 

The tet locus was first mapped on RP4 by Barth and Grinter (J.Mol. Biol.113 : 455-474, 1977). This was 
20 shown to consist of adjacent genes: tetA , the structural resistance gene and tetR , the repressor gene and this 
region has been sequenced (Klock etal , J. Bacterid : 161 :326-332, 1985). These genes are located on adjacent 
Bglll-Smal and Smal-Smal fragments. The Bglll site is unique in RP4 but there are five Smal sites (Lanka, Lurz 
and Furste, Plasmid 10 : 303-307, 1983). 

25 (i) Cloning the tetA + tetR genes 

The plasmid RP4 is well documented (Datta et al , J. Bacteriol 108 : 1244, 1971) and is freely available. Fur- 
thermore, the plasmid RP4 has been deposited with the National Collection of Type Cultures, 61 Colindale 
Avenue, London, NW9 5HT under accession numbers 50078 and 50437. RP4 obtained from N Datta (National 

30 Collection of Type Cultures) was used herein. E. coli strains containing this plasmid were grown in selective 
broth cultures and plasmid DNA was isolated by a scale-up of the Holmes and Quigley method (Holmes and 
Quigley, Anal. Biochem 1 14: 193-197, 1981). It was deproteinized by treatment with 2.5M ammonium acetate 
and reprecipitated with isopropanol. This plasmid DNA was treated, according to the supplier's recommended 
conditions, with restriction enzyme Bglll and cut to completion. It was then partially cut by Xmal by using diluted 

35 enzyme and short incubation times. Xmal is an isoschizomer of Smal but which produces 4-nucleotide cohesive 
ends at its cut sites. 

The vector plasmid pUC8 (Yanisch-Perron, Vieira and Messing, Gene 33 : 103-119, 1985) was similarly 
prepared and cut with BamHI and Xmal to completion. The RP4 fragments were cloned into this vector by liga- 
tion with T4 ligase at 1 2°C for 1 6 hours. This was used to transform E. co[i C600 made competent by the calcium 

40 chloride method (Maniatis et al , Cold Spring Harbor Laboratory, 1 982). Cultures were then plated onto medium 
which selected for tetracycline resistance. 

E.coli C600 is freely available from numerous sources including many culture collections such as the E.coli 
Genetic Stock Centre, Yale University, USA under accession number GCSC 3004. The genotype of E.coli C600 
is K12 thr-1 leuB6 thi-1 lacY 1 tonA2 1 X~ supE4 4. 

45 Several colonies with this resistance were checked for the expected phenotype (ampicillin and tetracycline 

resistance but not the kanamycin resistance indicative of RP4 itself). Colonies with the correct resistances were 
subjected to clone analysis by isolating plasmid DNA (Holmes and Quigley method). These preparations were 
cut with EcoRI and HindMI and analysed by gel electrophoresis. This established the size of the cloned insert 
which was found to be the 2.45 kb predicted for the Bglll - Xmal - Xmal fragment from RP4. A clone carrying 

so this fragment containing the tetA and tetR genes was designated pTB344. (Figure 2) 

(ii) Removal of the tet gene from pAT153 

It was necessary to remove the tet gene from the vector plasmid pAT153 before inserting the tetA + tetR 
55 cassette from RP4 to prevent gene duplication which can be a source of genetic instability. Also the tet gene 
may not be effectively suppressed by the non-cognate tetR . The removal was done by isolating plasmid pAT153 
DNA and cutting it with EcoRI and Aval . Between these sites, synthetic oligonucleotides with the following sequ- 
ence (SEQ IN NO. 40):- 

7 
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5 ' AATTCGCATGCGGATCCATCGATC3 ' 

3 ' GCGTACGCCTAGGTAGCTAGAGCC5 ' 

5 were cloned. These fit the EcoRI and Aval cohesive ends and contain Sphl, BamHI and Clal sites in addition 

Hatm^D^^ 
wasTsi™ 

10 ("') Insertion of the tetA ♦ tetR genes 

. f The and J** 9enes were isolated from pTB344 on an EcoRI to Pstl fragment. The pUC8 vector was 

255!? 5 H n9 ,^ h b6CaUSe " canrt " the same se ^°" n deleTminant (ampic!..* , 

P ICI0019 Plasmid plCI0019 DNA was cut with EcoRI and PsH and then ligated with the 2 45kb7raqmenTcar 

2H? ?• ~ 9en ? S - ThiS W3S US6d 10 tranSfomi i=S9Q C600. the culture being p.ated oui under XtilT 

£™I , Wmchsho ,dtn i ls,ose,team P icil,in stance determinant. Loss of ampicillin resistance frornThLa 
ana^rl 00 J™ 16 * 1 - A / ew done s were then used to isolate plasmid DNA which was subjected to resection 

20 (Figu^) S ^ ^ ^ C ° nStmCted P ' aSmid h3d the intended Structure " 11 — -UnliTSS 

(<v) Insertion of the cer sequence 

25 and D ^S!2l2 ! ? rtn8 P,aSm ' H d ^ iS ver V stab| y maintained in Exoli, whereas its derivatives P BR322 
25 and pAT153 are not. Summers and Sherratt( Cell, 36 : 1097-1103, 1 984) demonstrated that this was due to the 
denvat,ves not containing a short (283 bp) sequence called cer which is present in the paSasmid Thfe 
sequence conta.ns a site-specific plasmid multimer-resolution^stem which prevents the ai^'aZllas 
T 0 ?«T eT tT ed b y homolo 9° us ^combination. Such mu.timers have I 'deleterious e^ect o *e prcces" 

so Z n ° m f^ enSUreS Stab ' e inheritanCe ° f dau 9 hter P |asmids d """9 bacterial cell dlision 

h I k ^ e set ' uence (Summers.D et al MGG, 201, P334-338, 1 985) was isolated from plasmid pKS492 (oro- 

ToV?< T att> 85 3 289 bP fra9ment by CUtting with and Ii3i- The plasmTd pTB™5l was isolated 

as DNA from a distrain of BcoN to prevent its Oal site being blocked^ the dam+ methyla«on system T S 

foThisc.ontoTThf? n T and ~ (b f h th6Se Sit6S haVi " 9 bee " introdu - d ^ synZZZ^eJme 
« [f/n k 9 2' T , he ^ fra 9 ment was "gated with the cut vectorand then used to transform E coli C600 select 

35 toon bemg made or tetracycline resistance. Transformant colonies were subjected to donT^lysifby AvS 

restnctoon and gel electrophoresis. The presence of an extra DNA band of about 300 bp indicTte^racq^sS 

I£ -^ 9m ^- FUrther reStnCti ° n analyS6S were used to «"""" tnat ^"'tant plasmas ha^ th c^L* 
structure. One of these was designated plCI0042 (Figure 3 and 4). 

«> (v) Tests on plCI0042 

Thi induS^^H 61 ! ? mP '! t Sequenced usin 9 tne DuP °nt Genesis 2000 machine. (Figure 3 and 4). 
of ^S^t^SSST reS ' S,anCe ; a r S Checked - First *• minim- inhibitory concentration (M,C) 
or tetracycline for (plCI0042)C600 was measured. This was done by inducing a culture of this strain with 0 4 

SST^Xc to mSSSS to T about 100 colonies per plate - The M,c was 

line This was us Ji * T 1 ?? T , 9r ° Wn tQ ^ '° 9 PhaSe ,n Luria broth in the absence <* tetracyc- 
™ th f ' noculate P arallel cultures 'n broth, with and without tetracycline induction These were 
grown w,th aerafon at 37°C for 90 minutes to allow induced expression. Serial dilutions of these cultures ZeZ 

ZS n ££SE ^t7? n( l With ° Ut 100 fi9/m ' ^e resulting c^ZSZS 

mat the viab.l.ty of the uninduced culture was 1600-fold lower than the induced culture on the tetracvcline 

xlln^^.^ !*A + tetR induction system in p.C.0042 is working satisfcctorij, " 
The maintenance stability of pTB351 and plCI0042 in E. coli C600 were checked. This was i" order to com 
ss T?" 6 " 1 P ' aSmid PAT1 53 and n ° te the SSE sequence in P SoM2 Cut es we" 

slrr,. ♦ T P ' aSmid ,OSS bY 6ither Strai " tnrou 9hout this period. Thus pTB351, even withoufa cer 

of w .I' 3 " 6 StabNity ° Ver itS Parent PAT153 " This mav be a consequence o«»2iS 
of ,ts M gene. Naturally-occunng piasmids always have tetracycline resistance under inducible confrol so'he 
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constitutive tet gene in pAT153 may be counterselective in the absence of tetracycline. This could be due to 
the fact that the tetracycline resistance mechanism acts as a cytoplasmic membrane export pump. When not 
required it may harm the cell by damaging the membrane structure, exporting wanted metabolites or wasting 
metabolic energy. The presence of the cer sequence in plCI0042 should contribute to plasmid maintenance 
stability even under the counterselective conditions of using it to express a recombinant gene at high level. 

(b) Preparation of Plasmid pCH101 

The plasmid pCH1 01 corresponds to pICI 0020 (see Example 5c) except that the EcoRI-Sall fragment (see 
Figure 5a) is replaced by a fragment consisting of the SEQ ID No 34 (see Figure 5b also) and the interferon 
a 2 gene sequence as described by Edge M.D. eta[, Nucleic Acids Research 1983, Vol11, p6419-6435. In this 
regard the 3'-terminal ATG codon of SEQ ID No 34 immediately precedes the TGT codon which codes for cys- 
teine (amino acid 1) in the interferon a 2 sequence of the above-mentioned Edge M.D. et a[ Nucleic Acids 
Research reference. The 5' nucleotide sequence GATCCATG and the complementary 3' nucleotide sequence 
GTAC are thus omitted from the nucleotide sequence of the aforementioned reference. 

(c) Insertion of an Expression Cassette into pi CI 0042 

An expression cassette consisting of the trj> promoter, a ribosome binding site and the interferon ct 2 gene 
was isolated from plasmid pCH101 (see b above) on an EcoRI to Sphl restriction fragment. This was ligated 
into the production vector (plCI0042) (see above) similarly cut with EcoRI and Sphl . This DNA was used to 
transform a competent culture of E. coli C600 and tetracycline resistant colonies were isolated. A few of these 
were tested by DNA clone analysis for the acquisition of the Sstl restriction site carried on the expression cas- 
sette. Clones positive in this respect were further tested by restriction mapping to check that the expected con- 
struct was correct. They were also checked for the conferred capacity to produce interferon a 2 protein as 
analysed on a polyacrylamide-SDS gel stained with Coomassie blue. One such confirmed clone was desig- 
nated pLB005. 

(d) Insertion of T4 transcription terminator into pTB244 

The T4 transcription terminator sequence in the form of the Sail to Hindlll fragment (67 bases pairs long) 
(see SEQ ID No. 33 and Figure 1b) was inserted into the multicloning site of an intermediate vector pTB244 
(described in European Patent Publication No. 237,269) between its Sail and Hindlll sites. Clone analysis was 
used to confirm the structure of this construct (pTB244-T4 ter). From this vector, an Sstl to Sphl fragment con- 
taining most of the multicloning site and the T4 terminator was then isolated. This was inserted into pLBOOS 
similarly cut with Sstl and Sphl thereby substituting the interferon a 2 gene but leaving a cassette consisting of 
the tri> promoter, multicloning site and T4 terminator. This construct was confirmed by done analysis and the 
plasmid designated pLB013. 

(e) Substitution of the multicloning site 

The multicloning site in pLB013 is not ideal for this vector in several respects: the Sail BamHI and Smal 
sites are not unique but exist elsewhere on the plasmid. This fragment was therefore excised by cutting with 
Sstl and Xbal (both unique) and synthetic oligonucleotides with the sequence of SEQ ID No. 35:- 



were inserted in its place. Clones were analysed for acquisition of the new restriction sites and then confirmed 
by sequencing. One such plasmid was designated pLB014. The new cloning sites inserted in this way are: Ndel, 
Kpnl, Bgfll , Xhol and Seal with the previous Xbal and SaU following them. 

(f) Further modification 

It was discovered that the adjacent Sstl and Ndel sites in pLB014 could not be cut by both these restriction 
enzymes either simultaneously or sequentially presumably because of their close proximity. An additional sequ- 
ence was therefore inserted between them. This was done by cutting pLB01 4 with Sstl and Kpnl and then insert- 



AGCTCCATATGGTACCAGATCTCTCGAGAGTACTT 



GGTATACCATGGTCTAGAGAGCTCTCATGAAGATC 5 ' 
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ing the synthetic oligonucleotide of SEQ ID No. 36:- 

5 ' AGCTCAGCTGCAGCATATGGTAC 
GTCGACGTCGTATAC 5' 

EXAMPLE 2 

Preparation of [A^i.Ser^^human g_ csf usj ^ 

pJJSTS'S? Z2Z£££T*2 was d h i9ested with EcoR ' and Sa " in buffer H as described 

[AraiSvI'^r, "g c5t» ? ! "ntamna the trp promoter, ribosome binding site and gene for 
Z nt i h " G - CSFwasisolatedfrom a0.7%agarosegelbyuseofGeneclean(TM) A vector frao 



30 b) Fermentation 



*^Z™*Tc£7&T MSD 522 M *• — « »— *— ■»— - — 
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Composition of Growth Medium 

Made up of distilled water 



g/1 

KH 2 P0 A 3.0 

Na 2 HP0 A 6.0 

NaCl 0.5 

Casein hydrolysate (Oxoid L41) 2.0 

(NH A ) 2 S0 4 10.00 

Yeast Extract (Difco) 10.00 

Glycerol 35.00 

L-Leucine 0.625 

MgS0 4 . 7H 2 0 0.5 

CaCl 2 . 2H 2 0 0.03 

Thiamine 0.008 

FeSO^/Citric Acid 0.04/0.02 

Trace element solution (TES) 0.5ml l" 1 

Tetracycline lOmg I" 1 



Fermentations were then carried out at a temperature of 37°C, and at a pH, controlled by automatic addition 
of 6M sodium hydroxide solution, of pH 6.7. The dissolved oxygen tension (dOT) set point was 50% air-satu- 
ration and was initially controlled by automatic adjustment of the fermenter stirrer speed. Air flow to the fer- 
menter, initially 20L/min, corresponding to 1 volume per volume per minute (WM) was increased to SOL/min 
(2.5 WM) when the fermenter stirrer speed approached 80-90% of its maximum. Since the oxygen transfer 
rate (OTR) of the fermenters was unable to meet the oxygen uptake rate (OUR) of the bacteria at a cell density 
greater than that corresponding to an OD^ of 50 under the conditions described, dOT in the fermenter at cell 
densities greater than this was maintained at 50% air-saturation by restricting bacteria oxygen uptake rate. This 
was achieved by formulating the medium to become carbon-limited at OD550 of 50 and then supplying a feed 
of the limiting carbon source, together with ammonium sulphate and yeast extract, at a rate which restricted 
bacterial growth rate. 

Fermentations were performed for 1 8h and during that time samples were taken for measurement of optical 
density (OD550), cell dry weight and accumulation of [Arg^.Ser^^.^human G-CSF within the cells. 
[Arg^Ser^^^human G-CSF accumulation was measured by scanning Coomassie blue stained SDS-PA- 
GE gels of whole cell lysates of the sampled bacteria as is well known in the art 

When OD550 reached 35 (8.5h), casein hydrolysate solution (100g/1 Oxzoid L41) was pumped into the fer- 
menters at a rate of 0.75g/l/h. 

When OD550 reached approximately 50, the supply of carbon-source in the fermentation batch became 
exhausted leading to a rapid rise in dOT from 50% air saturation. At this point, a feed containing glycerol 
(470g/1), yeast extract (118g/1) and ammonium sulphate (118g/1) was pumped into the fermenters at a rate 
which returned and then maintained the dOT at 50% air saturation with the fermenter stirrer at ca 70-80% of 
its maximum. Casein hydrolysate feeding was maintained at 0.75g/l/h throughout. After approximately 18 hours, 
when microscopic examination of the culture showed the presence of large inclusion bodies within a majority 
of the cells, bacteria were harvested on a Sorval RC3B centrifuge (7000g, 30 min., 4°C) and stored frozen at 
minus 80°C. 

EXAMPLE 3 

Preparation of [Arg^.Ser 17 - 27 ' 60 . 65 ] human G-CSF using vector including T7A3 promoter 

a) An EcoRI-Sall fragment, containing a T7A3 promoter, a trp leader ribosome binding site sequence and 
a gene for [Ser 17 27 ]hu G-CSF was sub-cloned into M13 mp18 as described in part d) of Example 5. The sequ- 
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ence of the EcoRI-Sall fragment is set out in SEQ ID No 32 and Fiqure 9 SEQ ID No rnncict* fh ■= D . 
resection site (nucleotides 1-6), the A3 promoter sequence of SS^^^^S^^^ 

codes for threonine (amino acid 1) in Figure 9. The 5' nucleotide sequence aSJt^^^^S2 

SiS^liTS mt ECOR ' • Sa " ^ ^ a,S ° 66 ^ » ^£S£SZ £ 
SscTbed?n S^Z 7 mUta , geneSl f W8S Perf0rmed ° n sin 9 le - s ^nded DNA as described in the protocol 
in nl m t ? a ^ P i S 7 US,n9 0,l 9<>n"cleotide SEQ ID No 28 to convert the codon for Gin at position 1 I to Ara 
10 Double-stranded RF DNA was prepared from a plaque containing the Glnii^ii^J^SJi^ 

d^cTbe, ;r SPt IT 1 T 83 inCUbati ° n WaS f ° r 3 h ° urs instead of 5 »°™> ^ digested with Ec^RI (as 
descnbed prev.ously) and SnaBI (10 units, 1xM buffer, BCL, 30ul, 2 hours 37°C> TheLZT^ c 

« I X £ Te l™ ,SO ' ated and M9ated t0 a " EcoR| -SnaBI cut vector from pIC1 1 327 (which contains3ons 
^ t T f iS deSCr,b6d in Examp,e 2) - The M 9 ation mix wa * "^d to transform £ 001^^0521 
and ransformants selected for growth on L-agar plates containing tetracycline (IS^g/nTg) "SasmTd DNA fr™ 
a colony contain.ng the expected T7A3 promoter and [Argn SeriA27.eo.es! hu Q-CSF aen^n InT ° NA .™ m 
tified by sequencing DNA from the isolated p.asmid and designated p.C. 1 3^6 ' ' ^ lde "" 

20 effected 'TtlTZ^V h aCCOrd ; n9 t0 *° alt6matiVe PrOC6SSes < b > and < c > be,ow - <"> was 

[Ami 727^ 1 5 h ° UrS fermentati0 " ^ described, microbial biomass was 35 g/l and 

Iffected " 3o"c3dT a l f T 6Stimated 10 be accu ™ lated to 79/1 fermentation broth. Process^) was 
WKh ™2h f fermentation was accordingly slower because of the lower fermentation temperature 

G CSFuaW P^s(c) after 35 hours, the microbial biomass was 55 g/l and the [Arg-Seri^San' 
G-CSF yield was est.mated to be accumulated to 15 g/l fermentation broth 

25 ££3* ai ," CG 5 C ,^ 00 < genot yP e F "^' obtained from the E.coii Genetic Stock Centre was trans 

^^ri^ , ^^^^' Wn CGSC 6300 (P ' Cll3T6 > was « an^^ainedT 
glycerol s ocks at -80 C. An aliquot of the culture was removed from stock and streaked onto aqar elates of 
L-tetracyclme to separate single colonies after overnight growth (1 6h) at 37°C 

StU^T 9 T ' 31 37 ° C ° n 3 reci P rocati "9 shake ' contents of the I^^^^SSZ 

iz?r;srr containins 20 li,res of modffied lcmso 9rowth medium - The ™ 
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TABLE 1: Composition of growth medium 



Modified LCM50 Growth Medium (A) 

made up with distilled water 

g/i 



10 


KH PO 

2 4 


3.0 




Na2HP0^ 


o • u 




NaCl 


0.5 


15 


Casein Hydrolysate (Oxoid L41) 


2.0 


(NH 4 ) 2 S0 4 


10.0 




Yeast extract (Difco) 


20.0 




Glycerol 


35.0 


20 


MgS0 4 .7H 2 0 


0.5 




CaCl 2 .2H 2 0 


0.03 




Thiamine 


0.008 


25 


FeS0,/Citric acid 
A 


0. 0A/0. 02 




Trace element solut ion(TES) 


(0.5ml l" 1 ) 




Tetracycline 


(10 mg I" 1 ) 



30 The fermentation was then carried out at a temperature of 37°C and at a pH, controlled by automatic addi- 

tion of 6M sodium hydroxide solution, of pH 6.7. The dissolved oxygen tension (dOT) set point was 50% air 
saturation and was initially controlled by automatic adjustment of the fermenter stirrer speed. Air flow to the 
fermenter was initially 20 L/min corresponding to 1.0 volume volume per minute (WM) and was increased to 
45 L/min manually when the fermenter stirrer speed reached its maximum (1000 rpm). The fermentation was 

35 performed for 1 6h and during that time samples were taken for measurement of optical density of the culture 
(OD 550 biomass concentration, total microbial protein concentration and accumulation of [A rg 11 ,Ser 1 7 ' 27 - 60 - 65 ] hu- 
man G-CSF within the bacterial cells. Accummulation was measured by scanning Coomassie blue stained 
SDS-PAGE gels of whole cell lysates of the sampled bacteria as is well known in the art. Total microbial protein 
was estimated by the method of Lowry. A solution of yeast extract (225 g/L) was pumped into the fermenter 

40 4.5h post inoculation at 1.7 g/L/h. When the supply of carbon source (glycerol) in the growth medium became 
exhausted dOT increased rapidly from 50% air saturation. At this point a feed containing glycerol (714 g/l) and 
ammonium sulphate (143 g/L) was pumped. Since the bacterial oxygen sulphate rate (OUR) approached the 
maximum oxygen transfer rate of the fermenter (OTR) just prior to the carbon source in the batch growth medium 
becoming exhausted, the feed was pumped into the fermenter at a rate which restricted the bacterial OUR to 

45 approximately 80-90% of the fermenters maximum OTR. The feed rate was adjusted manually to return and 
then maintain dOT at 50% air saturation under the conditions described. 

c) The fermentation process described in (b) was repeated but at a temperature of 30°C for 35 hours. Except 
for the fermentation temperature of 30°C the medium and fermentation conditions were identical to those des- 
cribed in (b). 

50 

EXAMPLE 4 

Preparation of plasmid pAG88 
55 a) Preparation of a synthetic gene for human G-CSF 



A DNA sequence (Figure 10) encoding the amino-acid sequence of the polypeptide of Figure 10 (human 
G-CSF) was designed according to the following considerations: 
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1) Single - stranded cohesive termini to allow ligation at suitable sites in a plasmid 

manipulation^ reStriCti °" endonuc,ease sequences throughout the gene to facilitate subsequent genetic 

3) Translation termination codon. 

In^h?! 31 the f/- end of ] he codi "9 re 9 ion were n °rmally chosen to be A/T rich. Other codons were nor- 
mally chosen as those preferred for expression in E.coli. 

The gene was assembled from the 18 oligonucleotides designated SEQ IDNo.1 -SEQ ID No.18 and shown 



hereinafter. 



io Preparation of Oligonucleotides 

« SU nthl!r c < !lH 0nU( i?!. tide S u equences snown hereinafter were prepared on an Applied Biosystems 380A DNA 
synthesiser from 5<-d,methoxytrityl base-protected nudeoside-2-cyanoethyl-N,N-diisopropyUosphoramidites 

is tocols supplied by Applied Biosystems Inc. «" pro- 

son andTrSh' ^ t I? | ° li 9° n y cl f « d « sequences may be prepared by manual methods as described by Atkin- 
S!S^"pi5SSp Synth6SiS ' 3 ^ APPr ° aCh ' < M - T " Gait ' ^ — Oxford, 

20 syntLTser w^ * ™ " "» biosystems 380A DNA 

^Jt? h n 0,i9 ° nu ,? e ° tide ' after cleavaae from the solid support and removal of all protecting groups, was dis- 

S£? ? "ST A SOlUti ° n ° f 3M S ° dium aCetate (pH5 6: 4( W and etha "°« dmO was added to the 
ol igonucleotide soUitions (400^) and the mixtures stored at -70°C for 20 hours. The resulting precipitates were 

25 Sen dr^d SS " 9ati ° n ( H 3 J 00,pm for 1 ° minUt6S) and the Pe " ets washed wrth «h^SiioSp!Io 
EDTA, 0.01% Bromophenol Blue, 0.01 % xylene cyanol. 80% formamide 

« 0li9 ° nudeo i tides were P uri fied °n a 10% polyacrylamide gel in 50mM Tris-borate (pH8.3) containina 

8.3M urea Ohgonucleotides of correct length were identified by UV shadowing (Narang et a 1979 i^Zs 
!"! ym ° , ° 9y V °' 68, 90 " 98) - norma,| y the most Pr^inent band - excised from the gel and elecfroelute^n 

^lent wTn ^ /* 3 °° mV ^ hOUrS " The aqUe ° US SOlUti ° nS Were «ncJL^E^£ 
treatment with n-butanol (m,x, spm and removal of the upper organic layer). The purified oligonucleotides were 
preciprtated at -70°C for 20 hours from a 0.3M sodium acetate solution by addition of ethanoK2 Tvdumesl 

Assembly of gene 

is 

.ated^rT^ 68 . 8 ^ "J N ° 2 " SEQ ' D N ° 1 7 (40 ° PM ° f each > I as defined hereinafter] were phosphory- 
lated with T4 polynucleotide kinase (3.6 units) for 2 hours at 37»C in 25uj of a solution conta ning ATP (800pM 

?DTA n '" 9 ^ 5pM , 9amma - " P ATP) ' 1 °°* M s P«r-idine, 20mM MgCI 2 , 50mM Tris-HC. (pH9.0) and 0 1mM 

o sho^ il Tah^ 7* h f at6d f 1 °°° C f ° r 5 m,nUteS tQ terminate the reactions - 'he' -mixed in pairs as 
o shown '" Table 1 to give duplexes A to I (Oligonucleotides SEQ ID No 1 and SEQ ID No 18 (400mM in 25ul) 
were used unphosphory.ated). 0.3M Sodium acetate (pH5.6, 200ul) and ethano. (850^1) were added Ind tht 
P t h e T tete , d 31 " 2 °° C f ° r 20 hOUrS " ThS r6SUltin9 Precipitates were collected by cen^gatfon ^d 

l^tTr^ r1™T <7 i 3) then diSSO ' Ved " Wat6r (5 °^ The Pairs ° f oligonucleotides wereYnnea^ 
together by first heating the solut.ons to 100°C for2 minutes in a boiling water bath. The bath was then allowed 

?r i r ,t ° 4 °° C <ab ° Ut 4 hOUrS) - S ° lutions containin9 3 P a "* of duplexes were combined ^"0^ 

BRL), 50mM Tris (pH7.6), 10mM magnesium chloride, 5% (w/v) PEG 8000, 1 mm ATP 1mm DTT <BRL Focu* 
Vol 8 no 1 Winter 1986) and the DNA ligated at 30=0 for 5 minutes fo.lowed by 20 hou^ ?6X IS So^m 

> s^SL <iM a added and the product precipitated 

T^lli w . VT?: The prec,Dltate was col| ected by centrifugation and washed with elhanol 1ml) 

jZ^eMW mM EDTA ndf0 HT, a M ide/dy r " PUrifiSd ° n 3 1 ° % Po'V-rylamidege. in 50mM 

k o f h u " d 8 3M Urea " BandS f ° r Strands of a PPropriate lengths (173-186 bases) were 
identified by autorad,ogra P hy and isolated together by electrocution from a single gel slice as described atove 

touni; at loo c for 2 minutes, then allowing it to cool to 40°C over 4 hours 

„ m H Gr ?T " 3nd '" W6re ' igated t09ether essential| y as described for the group preparation to give as the 
00?^'. o?H 9 T 6 SeqUe r Sh ° Wn in Fl ' 9Ure 1 °- Aft6r P reci P^ion, the gene wL phospho^ted wrth T4 
polynucleotide kinase as described previously for individual oligonucleotides, then dissolved in water 
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TABLE 1 



DUPLEX 



OLIGONUCLEOTIDE 



NUMBER OF BASES IN 
TOP STRAND BOTTOM STRAND 



10 



15 



20 



25 



A 


SEQ 


ID No 


1 + 


SEQ ID No 2 


62 


64 


B 


SEQ 


ID No 


3 + 


SEQ ID No 4 


60 


60 


C 


SEQ 


ID No 


5 + 


SEQ ID No 6 


48 


51 


D 


SEQ 


ID No 


7 + 


SEQ ID No 8 


63 


60 


E 


SEQ 


ID No 


9 + 


SEQ ID No 10 


63 


63 


F 


SEQ 


ID No 


11 


+ SEQ ID No 12 


60 


63 


G 


SEQ 


ID No 


13 


+ SEQ ID No 14 


63 


60 


H 


SEQ 


ID No 


15 


+ SEQ ID No 16 


60 


60 


I 


SEQ 


ID No 


17 


+ SEQ ID No 18 


55 


53 


I 


A + 


B + C 






170 


175 


II 


D + 


E + F 






186 


186 


III 


G + 


H + I 






178 


173 



b) Cloning of the synthetic gene for human G-CSF 

30 The synthetic gene described above, was cloned into the plasmid vector, pSTP1 (Windass et al, Nucleic 

Acids Research, 1983, Vol 10, p6639). 

For vector preparation, 10ng of STP1 was dissolved in water (37.5u.l) and 10 x B restriction buffer (4.5uJ) 
(BCL). the restriction endonuclease Sail (3ul) (BCL, 8 units/^il) was added and the mixture incubated at 37°C 
for 1 hour until linearised plasmid was predominant over supercoiled and nicked circular forms. The DNA was 

35 precipitated with ethanol at 4°C for 30 minutes, washed with ethanolrwater (7:3) then dissolved in water (39.5uJ), 
1 0X H buffer (4.5^) (BCL). The restriction endonuclease EcoRI (1 (BCL, 90 units/^il) was added and the mixt- 
ure incubated at 37°C for 1 hour until the large EcoRI-Sall fragment was predominant. The DNA was precipi- 
tated at -20°C for 20 hours, washed with ethanokwater (7:3) then dissolved in water (20u.l) 

The large EcoRI - Sail fragment was purified on a 1% preparative agarose gel and electroeluted and pre- 

40 cipitated as described previously, then dissolved in water (20^1). For ligation of the synthetic gene, a mixture 
of vector DNA (2uJ of the EcoRI - Sail fragment solution), synthetic gene (5uJ of the aqueous solution described 
previously, 5X ligase buffer (6^ -250mM Tris pH7.6 50mM MgCI 2 , 25% W/V PEG8000, 5 MM ATP, 5mM DTT 
exBRL) water (1 5^1) and T4 DNA ligase (2uJ, 1 U/\x\) was incubated at 16°C for 4 hours. The DNA mix was used 
directly (either 1uJ of neat ligation mix or 2^il of ligation mix diluted 5X with water) to transform E. coli strain 

45 HB101. The DNA mixture (1 or 2uJ) was added to competent E. coli HB101 cells (20jil, BRL) on ice and the 
mixture incubated on ice for 45 min then heat shocked at 42°C for 45 seconds. After 2 min on ice, 100jil of 
SOC buffer (Bactotryptone 2%; Yeast Extract 0.5%; NaCI 10mM; KCI 2.5mm; MgCI 2 , MgS0 4 20mm (10mm 
each); glucose 20mm) was added and the mixture incubated at 37°c for 1 hour. Aliquots of suspensions were 
plated onto L plates with 50^l/ml ampicillin. Transfonmants were screened for the presence of cloned synthetic 

so gene by colony hybridisation analysis using standard methods described in "Molecular Cloning: A Laboratory 
Manual" by Maniatis etal (Cold Spring Harbor) and in UK Patent Application No 8502605. A total of 100 colonies 
were streaked onto filters (Schleicher and Schuell), grown at 37°C for 20 hours, lysed and baked. The filter 
was hybridised at 65°C for 20 hours with a radioactive probe prepared from oligonucleotide sequence SEQ ID 
No 1 by use of a random-label kit (Pharmacia). Five colonies 1-5 giving a positive hybridisation signal were 

55 grown up in L broth at 37°C for 20 hours on a small scale (100ml) and plasmid DNA prepared by centrifugation 
in a caesium chloride gradient essentially as described in "Molecular Cloning; A Laboratory Manual" by Mani- 
atas et a[ (Cold Spring Harbor). 

The DNA was sequenced by the standard dideoxy chain-termination method as described by Sanger et al 

15 
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TABLE 2 



10 


CODE 






PRIMING SITE 


15 


SEQ ID 


No 


19 


214-234 top strand 




SEQ ID 


No 


20 


333-353 top strand 




SEQ ID 


No 


21 


375-395 bottom strand 




SEQ ID 


No 


22 


207-227 bottom strand 


20 


SEQ ID 


No 


23 


69-93 bottom strand 



25 



30 



35 



40 



45 



SO 



55 



The plasmid DNA from clone 5 contained the DNA sequence shown in Fioure 10 Th* ni a « mi w h ■ 
nated pAG88 and was used to transform competent ce..s 1 the following E.cd.stain SifSE^So 
(hereinafter also referred to as MSD 522) by standard procedures. 00 

EXAMPLE 5 

Preparation of M13mp18 template containing [Ser^T, human q_ csf 

The procedure for steps a) and b) in Example 4 was repeated with the following modifications- 

4 «aSr d^d^c^ 26 ^ 27 ^ h — 0 — - 2. 3 and 

c) Cloning of the gene for [Ser ^ human g. csf jnt<) an expressjon 

(1) a synthet.c E. coli trp promoter and trp leader ribosome binding site 

(2) a translation initiation codon 

(4) a synthetic transcription termination sequence 
The DNA sequence of this region is shown in Figure 5a 

DNA (1ng) in water (16uJ) 
10X T4 polymerase buffer (2\i\) 
0.33M Tris acetate pH7.9 
0.1M Magnesium acetate 
0.66M Potassium acetate 
5mM dithiothreitol 

1mg/ml bovine serum albumin (BSA PENTAX fraction V) 

2mM dNTP mixture (1 ^) 

T4 DNA polymerase (1^1; 2.5 units/^1 BCL) 

16 
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lowing the manufacturer's (Bio101 ? USA) recommended procedure. 

The synthetic gene was isolated from the pSTP1 vectors as follows. The vectors were digested with Seal 
and Sail (both from BCL) in 100mM Nad, 10mM MgCI 2 and 10mM Tris HCI (pH7.5). The 530 bp fragment was 
purified from a 0.7% agarose gel and isolated by use of Geneclean (trademark) following the manufacturer's 
5 (Bio101) recommended procedure. 

For ligation, a mixture of the Seal - Sail gene fragment (50ng) and the plCI0020 vector fragment (100ng) 
in 20uJ of a solution containing 50mM Tris HCI (pH7.6), 10mM MgCI 2 , 1mM ATP, 1mM DTT, 5% w/v PEG 8000 
and T4 DNA ligase (2 units; BRL) were incubated at 16°C for 20 hours. The resulting mixture was used to trans- 
form competent E. coli HB101 cells (as supplied by BRL) as described herein. Transformants were selected 
10 for by growth on L-agar plates containing 50ng/ml ampicillin and screened for the presence of the gene by col- 
ony hybridisation with a 32 P labelled probe (SEQ ID No 24) as described herein. Plasmid DNA was prepared 
from 6 positively hybridising colonies, purified by centrifugation in a caesium chloride gradient and the sequence 
confirmed by dideoxy sequencing as described herein. 

The plasmid containing this gene was designated pICI 1080. 

15 

d) Subcloning of an expression cassette containing a gene for [Ser 17 - 27 ]G-CSF into M13mp18. 

The following subcloning was effected to provide a starting point for preparation of the G-CSF derivatives 
detailed in Examples 3-8. 

20 Plasmid DNA from plCI1080 (purified by caesium chloride density centrifugation) was digested to comple- 

tion with EcoRI and Sail (BCL) according to the manufacturer's instructions. The small EcoRI-Sall fragment 
containing the trp promoter and [Ser 17 « 27 ]G-CSF gene was isolated from a 0.7% agarose gel by use of Genec- 
lean (trademark). This fragment was cloned into an EcoRI-Sall cut M13mp18 vector (DNA supplied by Amer- 
sham International; enzymes from BCL). The fragments were ligated together in 5x BRL ligation Buffer using 

25 BRL T4 DNA ligase (described previously). The ligation mix was used to transfect competent E. coli TG1 cells 
(made competent according to the calcium chloride method of Mandel and Higa described in Molecular Cloning 
- A Laboratory Manual - Maniatis et al Cold Spring Harbor). The transfected cells were suspended in TY top 
agar containing 2% X-Gal in DMF and 200ul log phase E. coli TG1 cells and were plated on 2x TY agar plates 
(TY top agar - 8g Bactotryptone, 5g Yeast Extract, 5g NaCI, 3.75g Bacto-agar in 500^1 sterile H 2 0; TY plates 

30 - 8g Bactotryptone, 5g Yeast-extract, 5g NaCI, 7.5g Bactoagar in 500 ml sterile H 2 0.) 

Four white plaques were picked into 4 x 2 ml 1% E. coli TG1 cells in TY broth (8g Bactotryptone, 5g Yeast 
extract, 5g NaCI in 500ml sterile H 2 0) aliquots and grown for 6 hours at 37°C. The 2ml cultures were split into 
0.5ml and 1.5ml aliquots. The bacteria were centrifuged out of solution in an Eppendorf, (trademark) microfuge 
and the supematents were transferred to sterile eppendorf (trademark) tubes. The 0.5ml aliquots were stored 

35 at -20°C as phage stocks. The 1 .5ml aliquots were used to prepare single stranded DNA following the method 
in the Amersham International M13 sequencing handbook (see below). These DNA samples were then sequ- 
enced using oligonucleotides SEQ 1D No 22, SEQ 1D No 23 and M1 3 Universal sequencing primer. The reac- 
tions were carried out using the Sequenase kit (trademark) according to the manufacturers instructions. All 4 
clones had the correct DNA sequence for [Ser 17 » 27 ]G-CSF. 

40 

Large-scale single stranded DNA preparation 

For single stranded DNA preparations of between 200-500|ig of DNA/ml, the method in the Amersham In- 
ternational "Oligonucleotide Directed Mutagenesis" was used. A detailed procedure is carried out as follows:- 

45 

LARGE - SCALE SINGLE STRANDED DNA PREP: 

A. Preparation of 1ml phage stock 

so 1 . Pick a single TG1 E.coli colony from a glucose/minimal medium plate. Grow overnight in 10ml 2 x TY 

medium, shaken at 37°C. Add 10uJ to 20ml of fresh medium, and shake at 37°C for 3 hours. 

2. Inoculate 1ml 2 x TY medium in a 10ml sterile culture tube with 100uJ of 3 hour culture from step 1. 

3. Inoculate the 1ml culture with a recombinant plaque. 

4. Incubate for 4 hours with shaking at 37°C. Transfer to a microcentrifuge tube. 

55 5. Centrifuge for 5 minutes at ambient temperature. Pour supernatent into a fresh tube. 

Store overnight at 4°C. Set up an overnight culture of TG1 E.coli for the next stage. 



17 



BNSDOCID: <EP 0502637A2_L> 



10 



15 



EP 0 502 637 A2 

B. Growth of 100ml phage culture. 

1 . Moculate 100ml 2 x TY medium with 1ml of overnight TG1 culture and shake at 37°C to an O.D 500 of 

2. Add the 1ml phage supernatent from A5 (above) to the 100ml culture 

3. Incubate for 5 hours with shaking at 37°C. Transfer to centrifuge tubes 

4. Centrifuge at 5000 x g for 30 minutes at 4°C 

p™ F s D^ZLT an M,ms ° Tak * ——->'-'-» — *— 
s to *d"1 ST"^ 20 * "* PEG 6000 " M NaCI * SUMmaBnt Mh « » 

7. Centrifuge at 5000 x g for 20 minutes at 4°C. Dscard supernatent 

pipette^" 96 ^ 5000 X 9 5 minUteS ' rem ° Ve a " remainin9 PEG/NaC ' with a drawn out Pasteur 

?0 R S2 e nlf he f 31 Pe !' et '' n 5 ° 0lil Wat6r (d ° Ub,e diStiMed) and transfer to a microcentrifuge tube (1 .5ml) 
l^T^ZT^e 3 mi ™^ to «* Transferee supeLtei 

for^utes 2 ^ 

12. Centrifuge for 5 minutes, discard supernatent 

1 1 4 f min . Ut6 n- P ar ^" y a " ^ ° f PEG/NaCI with 9 drawn out P-tour pipette. 

14. Resuspend the viral pellet in 500u1 double distilled water. 

15. Add 200^1 phenol saturated with 10mM Tris HCI pHS.O, 1mM EDTA. Vortex briefly 

16. Stand tube for 15 minutes at room temperature. 
25 1 7. Centrifuge for 3 minutes. 

1 8. Transfer supernatent to fresh tube. 

19. Repeat steps 15-18. 

20. Add 500ui chloroform and extract aqueous phase twice. 

21. Add 50^ 3M sodium acetate and 1ml absolute ethanol. Mix. 
30 22. Place in a dry ice and ethanol bath for 20 minutes. 

23. Centrifuge for 15 minutes. 

24. Wash each pellet with 1ml -20°C ethanol. Pour off 

ded V DNA Um ^ Pe ' ,et faiSe 10 d ° Ub,e diStN,ed W8ter TWS P rocedure y'^s 1 0u-200ng single stran- 
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EXAMPLE 6 



Preparation of pICI 1 1 07 

The procedure described in Example 5 was repeated except as follows-- 
H buS ^WtSZESSSZ* 74 POlynUCle ° tide ki " aSe a " d diaested with Mst., (10 units) in 1 X 

oe. r F 0 °nS n „ 9 P 7 ; e / ipitati ° n With ethano1 ' the 143 b P EcoRI-Mstl. fragment was purified on a 1 0% polyacrylamide 
fn LxXle 4 Urea ' " e,eCtr0eluti0n * 9« s " ce »• DNA strands annealed as descTbed 

The synthetic EcoRI-Mstll fragment described above was cloned into the plasmid vector pAG88 described 
BCLTof t o pr ! parati0n ' P AG88 < 10 «» was digested with Mst| P (20 units; BCL^fl X H buffer 

BCL 100 m) for 2 hours at 37°C. The DNA was precipitated with ethanol from 0.3 M sodium acetate at 20°C 
then digested w,th EcoR. (20 units; BCL) in 1 X H buffer (BCL; 1 00 ul) for 2 hours at 37°C. Follow^ pSdpiSon 
w,t .ethanol, the large EcoRI-Mstll fragment was purified on a 1% agarose gel and purified 

" eSCnDed b l manufacturer < Bi ° 101. USA). Colonies containing thT^n^SSSS 
seSS 21" radi ° aCtK,e Pr ° be PrSPared fr ° m °' i90nuC le ° tide < SEQ 10 No D and'the co"S 

55 EXAMPLE 7 

Preparation of plasmid pICI 1239 

The site-directed mutagenesis procedure described below was employed using the mutagenic template 
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M13mp18 containing the gene for [Ser 1727 ]G-CSF described in Example 5 or 6 (above). The mutagenic 
oligonucleotides used are designated SEQ 1D No 28 and SEQ 1D No 29 (as hereinafter defined). 

The triplet ACG in SEQ 1D No 28 serves to convert Gin at position 1 1 to Arg and the first and last AGA 
triplets in SEQ ID No 29 serve to convert Pro at positions 65 and 60 to Ser. The mutagenesis was carried out 
5 as described below using SEQ ID No 29 in a single priming mutagenesis. This yielded a single plaque which 
incorporated the Pro 60 Ser and Pro 65 Ser changes. Single stranded DNA was prepared from this plaque as 
described in the mutagenesis procedure described below . This DNA was used as a mutagenic template in a 
single priming mutagenesis using SEQ ID No 28 as mutagenic primer. This yielded >100 plaques, 3 of which 
were screened by DNA sequencing as previously described. All 3 had the full set of changes incorporated. Dou- 
10 ble - stranded RF DNA was prepared from one of the plaques by following the procedure for large scale pre- 
paration of single stranded DNA (step d in Example 5) to step B5. The RF DNA was extracted from the bacterial 
pellet by the alkali lysis procedure of Bimboim and Doly (Nucleic Acids Research (1979) 7, 1513-1523) and 
purified by caesium chloride density gradient centrifugation as described in "Molecular Cloning - a Laboratory 
Manual" by Sambrook, Fritsch and Maniatis (Cold Spring Harbor Publication). The purified RF DNA was diges- 
ts ted with EcoRI and Sail in buffer H as described previously and the 619bp fragment, containing the trp promoter, 
ribosome binding site, translation initiation codon and gene for [Ser 1727 ]G-CSF isolated from a 0.7% agarose 
gel by use of Geneclean (TM). The fragment was ligated into an EcoRI-Sall digested plCI0020 vector, using 
a 2:1 molar excess of insert to vector, with T4 DNA ligase (BRL) and ligase buffer, essentially as described 
previously. The ligation mix was used to transform E.Coli strain HB101. Transfonmants were selected for by 
20 growth on L-agar plates containing SO^g/ml ampicillin. Colonies were screened for the presence of the inserted 
DNA by restriction analysis of plasmid DNA prepared by the method of Birnboim and Doly as described in 
"Molecular Cloning - a Laboratory Manual" Sambrook, Fritsch and Maniatis (Cold Spring Harbor Publication). 
Plasmid DNA from a colony containing the expected 619bp EcoRI - Sail 
insert was used to transform E.coli strain MSD522 and designated plCI1239. 
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Site-directed mutagenesis protocol 



The phosphorothioate method of Eckstein and co-workers was used: 

Taylor, J W et al Nucleic Acids Research (1985) Vol pp 8749-8764 
30 Taylor, J W et al Nucleic Acids Research (1 985) Vol pp 8765-8785 

Nakamaye, K et al Nucleic Acids Research (1986) Vol pp 9679-9698 
Sayers, J R et al Nucleic Acids Research (1988) Vol pp 791-802 

The procedure can be carried out using a kit supplied by Amersham International. The method is outlined 
35 below and incorporates changes to the original method with regard to the use of more than one mutagenic 
oligonucleotide and the incubation temperature for oligonucleotides of greater than 30 bases in length. 

1. Annealing mutant oligonucleotide to single stranded DNA template: 

Single stranded DNA template (lug/ul) 5ul 

Phosporylated mutagenic oligonculeotide ( 1 . 6pmol/lul) 2.5ul 

Buffer 1 3.5ul 

45 Water 6ul 

(Where two mutagenic oligonucleotides were used simultaneously, 2.5\i\ (1 .6pmole/1 nl) of each phosporylated 
oligonucleotide was added to 5\x\ single stranded DNA template (1|ig/|jJ) in 3.5uJ Buffer 1 and 3.5u>l water. Where 
3 mutagenic oligonucleotides were used 2.5|il (1.6pmol/nl) of each phosporylated oligonucleotide was added 
so to 5jxl single stranded DNA (1u,g/uJ in 3.5^1 Buffer 1 and 1jil water). The above ingredients were placed in a 
capped tube in a 70°C water bath for 3 minutes if the oligonucleotide was <30bases in length or in a boiling 
water bath for 3 minutes if the oligonucleotide was > 30 bases in length. The tube was then placed in a 37°C 
water bath for 30 minutes. 

55 2. Synthesis and ligation of mutant DNA strand: 

To the annealing reaction were added 
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MgCI 2 solution 5^1 
Nucleotide mix 1 19^1 
(contains dCTP alpha S) 
water 6 ^il 
Klenow fragment (6 units) 1 .5^| 
T4 DNA ligase (5 units) 2^1 

The above ingredients were placed in a 16°C water-bath and left overnight. 

3. Removal of single stranded (non-mutant) DNA using disposable centrifugal filter units. 

To the reaction from Step 2 the following ingredients were added- 
Water 170uJ 
5MNaCI 30nl 

100 r^fZ mM NaC ' 3dded r6SPUn f ° r 10 minutes to wash throu 9 h «V ^maining RF DNA 
The following ingredients were added to the filtrate:- 

20 3M Sodium Acetate (pH6.0) 28fil 
Cold Ethanol (-20°C) 700jtl 

The mixture was placed in a dry ice and ethanol bath for 20 minutes and centrifuged in an Eooendorf micro- 
fuge for 1 5 minutes. The pellet was then resuspended in 10nl buffer 2. Eppendorf micro _ 

4. Nicking of the non-mutant strand using Nci I. 

,n , ™Hlt aaded " W ^ 3 8 Un,B NC ' ' (1 * The ~ 

30 5. Digestion of non-mutant strand using exonuclease III 

To the reaction mix from step 4 was added 
500mMNaCI 12^1 
Buffer 4 10|Ll | 
35 Exonuclease III (50units) 2ul 

ill wiit^H" m '^ ure was placed in a 37°C water bath and incubated for 30 minutes at 37°C, 50 units of exonuclease 

40 6. Repolymerisation and ligation of the gapped DNA. 

To the reaction mix from step 5 was added 
nucleotide mix 2 -| 3^1 

MgCI 2 solution 5^1 
45 DNA polymerase I (4 units) 1^| 
T4 DNA ligase (2.5 units) iuj 

The mixture was placed in a 16°C bath for 3 hours. 
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7. Transformation of competent host E. coli TG1 cells with the DNA: 

wereTr^^ Ce " S (Prepared fol,owin 9 the ™*»d of Mande, and Higa) 

were transformed with 20-al of the reaction mix from step 6 (in duplicate) 

ovin'ghf *™7?C. ^ ' ^ ° f '° 9 Ph3Se TG1 Ce " S ta ^ T ° P ™ ™« -abated 

USA T and E from IT ^ I *T* y aM]e ^ for exam P ,e the E. coli Genetic Stock Centre, Yale University 
HP^QMA P„ 1 A ™ rSham nternat,onal P |C - Amersham-Place, Little Chalfont, Amersham, Buckin^amS 
RPN 1523) 9 aSSUPP " ed " their "^^" mutagenesis system, oligonucleotide directed i^SSSS 
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EXAMPLE 8 



Preparation of plasmid pICt 1295 (also referred to as pCG300 ) 



(a) Production of pCG54 from plCH079 



plCI1079 is an ampicillin resistant, pAT1 53-derived plasmid containing the following elements between the 
EcoRI and Sty 1 1 restriction sites:- 

(i) a CI857 from phage A.; 

(ii) a XP L promoter; 

(iii) a synthetic ribosome binding site; 

(iv) a synthetic interferon a 2 gene sequence; 

(v) a synthetic transcription terminator sequence, derived from phage T4, between the Sail and Styl res- 
triction sites. 

The DNA sequence of this transcription terminator is shown in Figure 1(b) and SEQ ID No. 37. 
plCI1079 is illustrated in Figure 6. 

plCI1079 has been deposited under the Budapest Treaty, at the National Collections of Industrial and Marine 
Bacteria Limited (NCIMB), 23 St. Machar Drive, Aberdeen, AB2 1RY, Scotland, UK. (NCIMB No 40370, date 
of deposit 19 February 1991). 

pCG54 was constructed in order to make available an expression vector containing the same promoter, ribo- 
some binding site and transcription terminator sequences as above, ie: A,p L , RBS7 and T4, but lacking gene 
sequence encoding for production of a specific protein. Such a construct would provide the facility of a basic 
expression vector containing essential elements allowing transcription and translation for production of any pro- 
tein of interest which could be introduced into this vector by subsequent cloning events. 

Construction of the vector was initiated by restriction endonuclease cleavage of plCI1079 at its respective 
EcoRI and Sail sites. This cleavage step released a vector fragment containing the plCH079 backbone com- 
plete with genes for plasmid replication and antibiotic resistance functions, plus the T4 transcription terminator 
sequence. The fragment was isolated by agarose gel purification steps using Geneclean for final purification 
of the DNA fragment. 

To this vector fragment a second smaller DNA fragment of approximately 1 .2Kb in size was introduced. 
This second fragment may be obtained, for example by DNA synthesis or by site directed or PCR mutagenesis 
of the small EcoRI-Sall restriction fragment obtained from plCI1079 as described above. This second fragment 
contained exactly equivalent promoter and ribosome binding site sequences as originally present in plCI1079 
and additionally had EcoRI and Sail sites available at its 5' and 3' termini respectively, so providing compatible 
termini for ligation to the plCI1079 fragment. A ligation reaction in the presence of Gibco-BRL enzyme T4 DNA 
ligase and its respective buffer, resulted in the formation of the construct pCG54. 

Clones containing this construct were originally isolated following transformation of an aliquot of the ligation 
reaction mixture into E.coli competent cells of strain HB101. 

The construct pCG54 recovered was 3.682Kb in size and contained essential features as outlined on the 
map featured in Figure 1 1 . 

(b) Production of pCG61 from pCG54 (also referred to as plCI54) 

Synthetic oligonucleotide sequences were designed so as to include both the natural sequence for the 
T7A3 promoter and also a sequence which would provide an effective translation initiation region to enable 
correct processing of any polypeptide gene sequence cloned adjacent to it. A suitable candidate sequence for 
this latter region was identified as RBS1, the trp ribosome binding sequence. Therefore two complimentary 
oligonucleotides identified as SEQ ID No.38 and SEQ ID No. 39 were synthesized to generate a double stranded 
DNA linker incorporating the T7A3 promoter and RBS1 sequences . 

Oligonucleotides were prepared as 84mers by the standard protocol using an ABI gene synthesizer. They 
were designed so that in the double stranded form the synthetic fragments would have restriction endonuclease 
sites EcoRI and Kpnl at the 5' and 3' ends respectively. Due to their length the oligomers could not be purified 
by means of HPLC and purification was undertaken by means of acrylamide gel electrophoresis using a 10% 
acrylamide: 7M Urea gel. 

Prior to purification, the oligomers were first checked on a sizing gel to ensure that not only are they of the 
correct size but that also the samples prepared contain as their greatest proportion the oligomers required and 
not a high contaminating proportion of smaller secondary oligonucleotides which result as by-products of synth- 
esis. 
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EXAMPLE 9 - PREPARATION OF RICIN A 

55 9.1 Synthetic oligonucleotides 
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5'-dimethoxytrityl base-protected nucleoside-2-cyanoethyl-N,N- 

diisopropylphosphoramidites and protected nucleosides linked to controlled-pore glass supports on a 0.2 micro 
mol scale, according to protocols supplied by Applied Biosystems Inc. 

Each oligonucleotide, after cleavage from the solid support and removal of all protecting groups, was dis- 
solved in water (1ml) and a measurement of absorbance at 260nm used to determine concentration. 



9.2 Enzymes 

A variety of restriction endonucleases and DNA modifying enzymes were used in the manipulations des- 
10 cribed below. These were purchased from one of a number of suppliers (Amersham International, Bethesda 
Research Laboratories, Boehringer Mannheim or New England Biolabs) and used in accordance with the man- 
ufacturers instructions with respect to reaction conditions. 
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9.3 Construction of the pICI exprerssion vectors 
9.3 a) PICIQQ20 



As mentioned in Example 5(c), plasmid vector plCI0020 is a pAT153 based plasmid in which the 651 bp 
EcoRI-AccI region is replaced by a 167 bp EcoRI - Clal fragment consisting of:- 
20 (1) a synthetic E. coli trp promoter and trp leader ribosome binding site 

(2) a translation initiation codon 

(3) a multiple restriction enzyme recognition sequence derived from M13mp18, containing sites for Kpnl, 
BamHI, Xbal, Sail, Pstl, Sphl and Hindlll 

(4) a synthetic transcription termination sequence 

25 The construction of a plasmid vector containing a synthetic trp promoter sequence is published (Windass 

et a[ NucAcids Res. 10 p6639-6657, 1 982). A promoter fragment was isolated from such a vector after digestion 
with the enzymes Eco RI and Hpa l and purification of the appropriate band from an agarose gel by electro-elution 
(in "Molecular Cloning - A Laboratory Manual", Maniatis, Fritsch and Sambrook, published by CSH laboratory, 
second edition 1989 and hereinafter referred to as "Maniatis"). 

30 A pair of complementary synthetic oligonucleotides were prepared which would ligate to the Hpa l end of 

the promoter fragment providing the natural trjD leader ribosome binding site, a translation initiation codon and 
a 3' Kpn l cloning site. These oligonuleotides were mixed in equimolar concentrations and allowed to anneal 
by heating to 100°C followed by slowly cooling to room temperature. 

The promoter fragment and annealed oligonucleotides were then ligated and the appropriate band isolated 

35 from a polyacrylamide gel by electroelution. This fragment was then ligated with an M1 3mp18 vector derivative 
containing the trp. attenuator sequence (generated from synthetic oligonucleotides) cloned into the Hindlll site 
and introducing an additional Clal restriction site 3' to the attenuator. The ligated DNA was transfected into E.coli 
strain JM109 (Yanisch-Perron et al Gene, 33, p103, 1985) made competent by the CaCI 2 method (Maniatis, 
chapter 1p82). After plating out and incubation of the plates, plaques were screened by the method of Benton 

40 and Davies (Maniatis, chapter 4p41) using a 32 P labelled probe generated by nick translation of the EcoRI- Hpa l 
promoter fragment isolated previously. Single stranded DNA was prepared from positively hybridising plaques 
by a standard method (Maniatis, chapter 4p29) and sequenced using the M13 universal primer and the Sanger 
dideoxy chain termination method as provided in kit form by a number of suppliers eg. Sequenase (United 
States Bioscience). 

45 RF DNA was prepared from one isolate in which the promoter/ribosome binding site/attenuator sequence 

had been confirmed. This DNA was digested with Eco RI and Clal and the appropriate fragment isolated from 
a polyacrylamide gel as above. Plasmid pAT153 was digested with the enzymes Eco RI and Accl and ligated 
with the isolated promoter fragment. Ligated DNA was used to transform competent E.coli HB101 (Bethesda 
Research Laboratories) and ampicillin resistant colonies selected. 

so Plasmid DNA from several clones was prepared and DNA sequence derived from the region between the 

EcoR I and Cla l sites. One clone confirmed as containing the correct promoter/attenuator region was named 
PICI0020. 

This construction is outlined in Fig. 12. 



55 9.3 b) PICI1079 



As mentioned in Example 8(a), plasmid vector plCI1079 is an ampicillin resistant, pAT153-derived plasmid 
containing the following elements between the EcoRI and Styl restriction sites:- 
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(i) a CI857 gene from phage X; 

(ii) a AP L promoter; 

(iii) a synthetic ribosome binding site; 

(iv) a synthetic interferon a2 gene sequence; 

5 Mrthn^5r n n fi S PHOn termi " at0r d^ived from phage T4, between the Sail and Styl res- 

frLteTin Figure 6 S6quence of this inscription terminator is shown in Figure 1b. plCI1079 is illus- 

SSZr n? "J" 08 *? UndSr the BUdap6St Tfeaty - The deposit has been made * the NCIMB, 23 St 
10 40sT ' ■ ^ ° f dSPOSit W3S 19 Febmar y 1991 and numb ^ '3 NCIMB 

This plasmid was used to provide a source of the T4 transcription terminator for the generation of the ricin 

SSSTSatS-? M . 185 ST be,OW) " ^ Startin9 »** f ° r thS 9— tionof this p"asm d ™ 
Sn^\p P , 3Sm,d ^ ° n P,C,002 ° (See 9 3 3 above > in which an expression cassette con- 

» Sen ^ ^ ^ * * ^ * P ™ 5 ' 1983 > " — 

fle n^Th!T nta T ^"^ ™ oli 9° nuc,eotides was synthesised to generate the transcription terminator from 
gene 32 of bactenophage T4 with 5' Sail and 3'SpJil cohesive ends. This fragment was ligated with a plasmS 
fragment isolated from p.CM043 which had been digested to completion with Sal. and Sp^l The^ ntermS 
plasmid thus produced (p.C,1078) contained both the T4 terminator and trp. a^nuator^uencer ntenTrrf 
Asecondpa-rofcomplemetaryoligonudeotideswasthenusedtorepla^ethetoattenuatorseque^^ 

A T reSiStanCe 96ne) by inSerti ° n b6tWee " the ^ a " d of p CM078 

A unique BamHI site was introduced within this synthetic fragment. 
These manipulations are outlined in Fig. 1 3. 

25 9.4 Generation of a ricin A expressing clone 

9 -4 a) Preparation of pUC8RA plasmid DNA 

f C, ° ne < pUC8RA > was generated which contains the DNA coding for ricin A. This clone contains A-chain 
857?a^ 

5? a 265 27m ?n !t ' " " ^ (UmW - F " Roberts ' LM -. ^r6,JM. Eur.J.Biochem , 1985 

14§, p265-270) m plasmid pUCS (V.e.ra.J and Messing.J. Gene, 19, p259, 1982). In addition site-d rected 
mutogenesis has been used to generate a translation termination c^don immediately 3' to the finafcodon ot 

tncLeTin f R S T.T " 0 ' H r- M ^ FEBS L6ttS ' 1 987 ' 2& p73 - 78 >" The enti ' e A-chain cod ng 'egion 
is included m a Bam HI fragment from this clone. 

of thtZl! qUantity SV^** Pl3Smid ° NA W3S ° btained the «H>i"atDrs. For future stocks, a dilution 
of this DNA was used to transform into Exoli DH5a competent cells (Bethesda Research Laboratories) an2 
an amfHcilhn resistant transformant selected. Plasmid DNA from this clone was prepared by 3EJbE 
lt^n^ P T/ ManiatiS ' Ch3Pter 1 P25) - Samp,6S ° f this DNA were di 9ested with BamHI and sZlseZ- 

to be Sentir 63 ^ ™* Md> 0 " Wb baSiS ' the ^ DN Asam P .es were assumed 

9.4 b) Sub-cloning into M13 

/AnnTTV dig t StS ( ° f PUC8RA plasmid DNA a "d RF (replicative form) DNA from the phage M13 strain K19 
£2Z B,0teChn0, ° 9 J ) We ? " shotaun * !i 9 ated u ^ng standard conditions (Maniatis, chapter 1p68) Con Jol 
l.gabons were also performed. The ligated DNAs were used to transform E.coli strain TG1 (Gibson 1984/C 
■■an) made competent by the CaC1 2 method (Maniatis, chapter 1p82) i»»4/Ang 

nh J*™.! ^ Pla T * T PlCked ** * ln * e slran<l DNA P«P>™«on. Direct gel electrophoresis of lysed 

seo^cinaT^h. "J* 0 *! "7 ""^ C<>n,aini " , ' steal>te insM «~ ""firmed oy 
sequencing to be the nan A-cham coding sequence. Only 182 bases of Che mature ricin A coding sequence 
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9.4 c) Mutagenesis of M1 3K1 9RA 

To generate a Kpn l site, compatible with pICI expression vectors, at the start of mature ricin A, the following 
changes (underlined) are necessary:- 

SEQ.ID.NO. 44 



5 ' GAT AACAACATATTCCCCAAA 3 ' 

.Ricin leader sequence . ...| Mature ricin A > 



Changed to: 
SEQ.ID.No.45 



20 5 ' GATAACAACATGGTACCCAAA 3 ' 

| Kpn l 
I 

25 Translation initiation 

and result in an ATG codon overlapping a Kpn l site. A Kpn l fragment containing ricin A can be excised from 
the mutant and inserted into the ICI expression vector series . Two N-terminal amino acid modifications are 
made ( ile-phe to met-val ). 

30 The single stranded DNA prepared from M13K19RA was the template for the mutagenesis step for each 

mutation strategy. A single oligonucleotide (DTR16) introducing all the mutational changes for this strategy was 
synthesised. 

DTR16 5' AACAACATGGTACCCAAACAA 3' SEQ.ID.NO. 46 

35 

Several protocols exist for the introduction of specific DNA sequence changes by site directed 
mutagenesis. The procedures outlined below were achieved using the method of Eckstein etal (Nuc. Acid Res., 
1985, 13 p8749-8764 and 1986, 14, p9679-9698) as provided in kit form (Amersham International) and used 
in accordance with the manufacturers instructions. 

40 The principle of this method is to prime the single-stranded DNA template with the mutagenic oligonuc- 

leotide and synthesise the complementary strand incorporating dATPaS in place of dATP. Using this nucleotide 
results in the formation of phosphorothioate bonds which are not cleaved by certain restriction enzymes (eg. 
Neil). After synthesis of the second strand, Nei l is used to nick the parent strand and exonuclease III added to 
digest back past the mutation point. DNA polymerase I then allows resynthesis of the parent strand. Conse- 

45 quently, the mutagenic oligonucleotide acts as a template for resynthesis and the mutation is introduced into 
both strands prior to transformation. Mutation frequencies up to 96% of the total progeny are claimed and 
screening is simply performed by picking plaques at random for sequence analysis. 
In our experiments 4 out of 4 plaques picked were correctly mutated. 

Having chosen one mutant (MRA16), RF DNA was prepared and checked for the presence of the newly 
so generated restriction fragment ie Kpn l. 

9.4.d) Cloning, Expression and Initial Characterisation 

The pICI series of expression vectors (see section 5) can accept DNA fragments cloned into a unique Kpn l 
55 restriction site adjacent to the Trrj promoter. The Kpn l site overlaps the translation initiation codon (ATG) which 
is situated 8bp downstream from the Shine-Dalgamo site (AGGA) of the promoter. 

Having verified the sequence of MRA16, a large scale (~5ng RF DNA) Kpn l digest was performed and the 
relevant ricin A coding DNA fragment isolated from an agarose gel (Nu-Sieve GTG agarose, FMC Bio-products) 

25 
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by phenol extraction of an excised gel slice according to the manufacturer's protocol 

PICI0020 (see 9.3a) was digested with Kpnl and then dephosphorylated using calf intestinal alkaline phos- 
phatase (CIP- Boehrmger Mannheim). The latter treatment prevents recircularisation of the vector upon ligation 
which would lead to a high proportion of parentals in the transformation progeny 
s Ligations were set up with ratios of plasmid vector to isolated fragment from 8:1 (w/w) to 1:3 for the various 

strategies^ Control ligations to test the effectiveness of phosphatase treatment, ligase activity etc , were 
included The ligation conditions were appropriate for the source of T4 DNA ligase used (New England Biolabs 
orAmersham). Reactions were generally incubated at 15°C overnight. 

,„ - *f S™* 6 "! ™ 3Ch ,i9ati0n (5M) reaCti ° n Was diluted to 100 ^ with 1 x ™ E (50mM Tris, 50mM NaC1 

ITJ wi? T ° fC °TT — 0841 ° 3dded - After 3 Standar * information protocol (Maniatis, 
chapter 1 p74), the cells were plated onto L agar plus streptomycin (25ug/ml) and ampicillin (100ug/ml) and incu- 
bated at 37°C overnight. ExoM DS41 0 has a chromosomal streptomycin resistant gene 

The transformation plates were examined after incubation. In general, 5 to 10 times more colonies were 
seen in ligations compared to controls without ligase. In some cases, little difference in the number of colonies 
is produced in the presence or absence of ligase occurred indicating incomplete digestion of the vector or poor 
ligase activity. K 

. . Tran sfo m ants. Plus the relevant controls were picked onto nitrocellulose filters placed on L agar plates for 

!na«? ,S A a i'° n SCr ! e ? in9 (b3Sed ° n the meth ° d ° f Grunstein and Hogness as described in Maniatis, chapter 
™ l-,£7L 1 ,ncubat,on ' the colonies were 'ysed insjtu using 1 0% SDS and 1 M NaOH, neutralised using 1 M 
20 I ns (pH 7.5) and dried under vacuum at 80°C for 2 hours. 

Hybridisation probes were generated by ™p labelling of the mutational oligonucleotides using T4 polynuc- 
leotide kmase. The filters were probed at room temperature and then washed in stages up to 55-65°C to remove 
non-spec.fica ly bound counts before autoradiography. Specific hybridisation indicated putative clones contain- 
ing ncin A DNA. 

Small scale DNA preparations (by the methods of Holmes and Quigley or Birnboim-Doly as specified in 
Maniatis, chapter 1p25) were made from positively hybridising clones. The DNAs were digested with the relev- 
ant restnction enzymes eg Kpnl and EcoRI/Bglll, and analysed by electrophoresis on agarose gels. Vector 
DNAs and mutated RF DNAs were cut with the same enzymes to demonstrate the fragment sizes expected 
for the correct clones. 

Larger scale plasmid DNA preparations (Birnboim-Doly) of each clone were used for more detailed res- 
triction ana ys.s, eg. Oal, Hindlll, BamHI, EcqRI/Bglll Kpnl, and Seal. On agarose gels, these digests showed 
the size of fragment inserted, an indication of its orientation and the gain of some unique ricin A-chain enzyme 
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35 9.4.e) Expression studies 
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a ™ rr . y ! 6d by n y bridisation and restriction screening were tested for expression of ricin 
A by SDS-PAGE analysis of total cell lysates. The standard conditions for expression studies were - 

1) Inoculate 10ml of L-broth + antibiotic(s) with a single colony and grow at 37-C overnight with gentle shak- 

40 ing. 

2) Take 750ul of the L-broth overnight and pellet the cells in a microfuge (1 min at 6500 rpm) 

3) Resuspend pellet in 300 M I M9 medium (Maniatis, appendix A.3) + 0.02% casein hydrolysate + 0 2% glu- 
cose + 50ng/ml thiamine and inoculate into 1 0ml of same. 

4) Incubate for 7 hours or overnight at 37°C with gentle shaking 

h!r e /M nCUb ? i0n K m ! aS, i? ° D540 ' Pe " et th6 Ce " S and resus P end t° 0 D640 = 10 per ml in Laemmli sample 
buffer (Maniatis, chapter 1 8p53). Boil for 1 5 minutes. 

6) Load 20^1 of total cell lysate on an SDS polyacrylamide gel, electrophorese, stain with Coomassie blue 
destain and visualise. 

Of the clones studied by SDS-PAGE, only 1 showed an additional band with 
equivalent molecular weight of -29KD (equivalent to that estimated for unglycosylated, mature ricin A). Gel 
pSlTl02 expression level to be in the range of 5-10% of total cell protein. This clone was named 

^ The construction of P ICI1 102 is outlined in Fig.14. Results of expression studies are shown in Figs.15 and 
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9 4 0 Western transfers and immunodetection of recombinant ricin A 



Authenticity of recombinant ricin A-chain protein, initially observed by Coomassie blue staining of SDS- 
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polyacrylamide gels, was confirmed by Western blotting. The protein bands were transferred to nitrocellulose 
filters and detected using a ricin A specific antibody followed by peroxidase labelled antiglobulins. 

1 5% SDS-PAGE gels were run overnight at 8mA then equilibrated for at least 30 minutes in transfer buffer. 

Protein bands on the gels were then transferred to nitrocellulose membranes (Hybond-C, Amersham) 
5 electrophoretically in a Bio-Rad Trans Blot apparatus at 70V for 3 hours. The filters could be stored, after drying, 
in sealed plastic bags at -20°C. 

Ricin A.1 was a polyclonal antibody raised in rabbits against a synthetic peptide fragment of ricin A. Pre- 
liminary studies showed good affinity for ricin A but considerable cross-reactivity with many E.coli proteins. To 
overcome the high background caused by this cross-reactivity the antibody was pre-incubated with an E.coli 
10 lysate. 

Thus, a 10ml L-broth overnight culture of E.coli strain DS410 was centrifuged at 4000 rpm for 10 minutes 
to pellet the cells. The pellet was resuspended in 5ml of bacterial buffer and sonicated at 4-6p, for 6 x 1 0 second 
bursts with 30 seconds cooling intervals on ice. 

0.5ml of sonicate was then mixed with 0.5ml of ricin A.1 antiserum and incubated at room temperature for 
15 90 minutes. Cell debris was spun down at 13000 rpm for 5 minutes and the supernate stored at ^20°C. 

The nitrocellulose filters from Western transfers were blocked by incubation overnight at room temperature 
in 5% BSA-PBS/Tween. (PBS Tween = 5ml Tween 20 per 1 litre of PBS). 
Washed 3x3 minutes in PBS/Tween. 

Incubated 2 hours (or overnight) at room temperature with a 1/4000 dilution of "blocked" Ricin A.1 antibody in 
20 0.5% BSA-PBS/Tween. 

Washed 3x3 minutes in PBS/Tween. 

Incubated 1 hour with a 1/1000 dilution of goat anti rabbit antiserum in 0.5% BSA-PBS/Tween at room tem- 
perature. 

Washed 3x3 minutes in PBS/Tween. 
25 Incubated 1 hour with a 1/5000 dilution of rabbit peroxidase anti-peroxidase antiserum in 0.5% BSA/PBS/Tween 
at room temperature. 
Washed 3x3 minutes in PBS/Tween. 

Developed by immersion in a solution of 4-chioronaphthol (60mg) in 20ml methanol made to 120ml with PBS 
and containing 12uJ hydrogen peroxide. The membrane was removed from the solution as soon as bands were 
30 visible, dried and photographed. 

A typical Western blot analysis is shown in Fig. 17. 



9.4 g) Biological assay for recombinant ricin A protein 

35 The aim here was to establish conditions under which samples generated during the ricin A-chain purifi- 

cation from E.coli cells could be tested for biological activity in a cell-free in vitro protein synthesis assay. 

Rabbit reticulocyte lysates were prepared according to the method of Allen and Schweet (J Biol Chem 
(1962), 237 , 760-767). The assay demonstrates inhibition of protein synthesis in a cell-free system by a lack 
of incorporation of 14 C-labelled leucine into newly synthesised protein. 

40 

9.4 g.i) The assay protocol 



Stock solution: 1mM amino acid mix minus leucine. 
A solution containing all L-amino acids at 1mM except leucine (adjusted to pH7.4 with NaOH and stored at - 
45 70°C). 

Soln. A 

40m M Magnesium acetate 
2M Ammonium acetate 
0.2M Tris 

50 (pH 7.4 with HC1 , stored 4°C) 

Soln. B 

ATP (Sigma A5394) 246mg/ml 
GTP (Sigma G8752) 24.4mg/ml 
Assay mix: 1ml Amino acid mixture 
55 1ml Soln. A 

0.1ml Soln. B 

103mg Creatine phosphate 
1mg Creatine kinase 
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510^1 H 2 0 

600^1 (60nCi) L- 14 C-leucine (New England Nuclear, NEC-279E) 
Reaction mix: Test sample 25p,l 
Assay mix 12.5^x1 
Rabbit reticulocyte lysate 25uJ 
Blank solution was 2mg/ml BSA in PBS 
All assays were done in duplicate 

. i l 2 '^ 1 f * Ssay mix P |aced in sterile 9 |as s tubes 25^1 of BSA in PBS added to each of first four tubes for 
blanks 25jJ of test samples added to rest of tubes 1ml 0.1 M KOH added to first two tubes (background blank) 
Tubes eqmhbrated to 28°C in a water bath 25ul of rabbit reticulocyte lysate (allowed to thaw from liquid nitrogen 

TfnZ^r* add6d t0 6aCh tUbe 3t 20 S6COnd interva,s - When first had '"Abated for 12 minutes 
1ml 0.1M KOH was added to each tube again at 20 second intervals to allow all tubes to have 12 minutes incu- 
bation. Two drops of 20% hydrogen peroxide were added to each tube followed by 1ml of 20% TCA 
Tubes were mixed and allowed to stand for at least 1 hour, or overnight, at 4°C. The precipitates were filtered 

SJLw qJC mp W3 f W J th i X 4 m ' ° f 5% TCA ' I™*™* to scintillation vials and 10ml scintilla* 

(Ready-Solv. MP, Beckman) added. After 1 hour the vials were shaken and counted. 

9 4 g.ip Establishment of technique for use with E.coli lysates 

1 0ml L-broth overnight cultures were grown at 37°C. 400^1 aliquots were pelleted at 1 3000 rpm for 30 sec- 
onds and most of the supernate decanted. 

The pellets were subjected to 2 rounds of rapid freezing in dry ice/EtOH followed by thawing at 37°C 12ul 
of 25 A sucrose in 50mM Tris HC1 pHS.O were added followed by 4»il of a 10mg/ml solution of lysozyme 

After.ncubat.cn on ice for 15 minutes, 8pJ of 0.25M EDTA were added and incubation continued for 15 
minutes Lysis was brought about osmotically by diluting the samples to 400 H I with water. This procedure pro- 
duced viable cell counts of 80-100 per ml. 

When a 25 M I aliquot of this lysate was added into the assay reaction mix, the level of incorporation of "C- 
eucme into newly ^synthesised protein was-10% of the blankwithoutlysate. This wasasimilarlevel of inhibition 
to that produced by 8ng/ml ricin A. Dilutions of the Exoli lysate were then prepared and the assay repeated 
The result c early ^showed that a minimum 16-fold dilution was necessary to reduce the effect of the lysate to 
equal that of the blank. 

In order to be as confident as possible that lysis of E^olj and ExoM lysates would not compromise ricin A 
toxicity, 2 control assays were performed. The first added plant-derived ricin A to a 16X diluted E.coli cell pellet 
so as to give a final concentration of 8ng/ml in the assay mix after cell lysis. Both these controlTshowed no 
deleterious affect from the lysates or the lysis procedure on the inhibitory action of ricin A 

These techniques were used to verify the synthesis of biologically active, recombinant ricin Afrom pICM 102 
and the clones described below. 

9.4 h) DNA sequence analysis 

Plasmid DNA sequencing was used to analyse plCI1102. The protocol chosen was modified from Zagursky 
§Lg (Gene Analysis Techniques Vol 2, N° 5) and involves alkaline denaturation of double stranded plasmid 
DNA prior to primer annealing and sequencing by a standard procedure such as that provided in kit form by 
several supphers, eg. Sequenase (United States Bioscience). By using an oligonucleotide to prime at the 3' 
gerIe 0 wts a Sbfe e ^ S6Veral A " Chai " intema ' PrimerS ' Sequencin9 both strands of the Pooler and ricin A 

The initial sequencing data revealed an unexpected result in that an additional Kpnl fragment was present 
between the promoter and ricin A coding sequence, ie: 
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SEQ.ID.NO. 47 



Kpn l 

5 ' AAAAAGGGTATCGACATGGTACCCGGGGATCCACCTCAGGGTGG 



Kpn l 



TCTTTCACATTAGAGGATAACAACATGGTACCCAAACAATAC 3 ' 



The additional Kpn l fragment has come from M13K19RA and contains restriction enzyme sites plus the 
part of the ricin leader sequence cioned from pUC8RA. The 5' region of the ricin A chain contains the base 
20 changes induced during mutagenesis. 

Study of this sequence reveals that the first translation initiation codon (ATG) is out of frame with that the 
ricin A coding region. Also, there is an in-frame termination codon (TAG) prior to the ricin A initiation codon 
and a putative Shine-Dalgarno sequence (AGGA) which could re-initiate translation from the second ATG. 
Subsequent studies revealed that, surprisingly, this additional DNA fragment conferred a beneficial advan- 
25 tage with respect to the accumulation level of ricin A-chain in E.coli when compared to clones from which it 
had been excised. 

The complete DNA sequence of the ricin A gene contained in plCI1102 is given in Fig. 18. 



9.5. Generation of subsequent ricin A expressing clones 
9.5 a) Mutation of Ricin-A clone plCI11Q2 to allow subcloning 



To subclone the two Kpn l fragments from the fortuitously generated pIC1 1 1 20 in the correct orientation for 
ricin-A expression would be difficult. Consequently, we planned to alter the internal Kpn l recognition site by a 
35 single base substitution (A to T). This would prevent Kpn l cleavage at this site and allow the subcloning of a 
single Kpn l fragment into the range of trp/RBS vectors. By substituting the adenine of the Kpn l recognition site 
(GGTACC) with thymine (ie GGTTCC) the first residue of ricin-A is unaltered (GTA/GTT = Val). 
ie: 

40 

Kpnl 53bp fragment Kpn l Ricin-A sequence Kpn l 

J I I 

GGTACC ATGGTACC | GGTACC 

45 TGA 
Changed to: 

Kpn l 53bp fragment Ricin-A sequence Kpn l 



GGTACC ATGGTTCC | GGTACC 

I TGA 
not recognised by Kpn l 
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The oligonucleotide synthesised to produce this change has the sequence: 

5' ATAACAACATGGTTCCCAAACAATAC 3' 
where the underlined base represents the mutational change- 

vJ^J^m t0 < S" e - ^ mUt3ted riCin " A fra9ment int ° 3 ran9e ° f fiE expression vectors for comparative 
express on stud.es. Cloning .nto P ICI0020 provides a comparison with pICI 1102 to determine the effects on 
expression, if any, of the single base substitution. 

9.5 b) Mutagenesis 

sent* »£? !S 'aC™^'* ^ " M13 C '° ne the *"° fra9ments P re " 

Zl Inl Pnlif muta 9 enesis - lsolat «s carrying the desired mutations were identified by random sam- 

specificaSy SeqU6nCe determination over the *> which the mutagenic oligonucleotide binds 

th„ ZTr mUtate A d te !T Plate ^ " amed MRA22 " ThiS WaS ana| y sed further ^ DNA sequence determination of 
the entire nc.n-A cod.ng sequence to verify the absence of non-specific mutations. 

20 9.5 c) Sub-cloning 

olanu^ Th-h* f "; 9,e - Stranded DNAs were used to transform competent ExoM TG1 cells to produce single 
plaques. Individual plaques were then picked and replicative form (RF, double-stranded) DNA purified by band! 

TTr! ethidium bromide buoyant density 9radients - The p»** o^T^Z 

£pnl cut and phosphatased express.on vector or by specific ligation of the ricin-A fragment after its purification 
from an agarose gel. Ligated DNA was transformed into Expli TG1 or HB101 Permeation 

h., «HK n ' A C ° nta ' ning clones were identified b y hybridisation screening using a labelled ricin-A probe pro- 

ulnn '« k™? , T" 9 h y bridisation were sc "*"ed further by restriction analysis of plasmid DNA 

and IcoS^ , 96St a " ^ R,/B ^" double di 9^- HP"' identifies the size of the inseL fragment 
and EcoRI/BglH determines the orientation of the fragment. 

to c.^ e l^ irmed ^ h !, Vin9 ! hS riCin " A fra9ment in thS COrrect orienta «°n ^ expression were subjected 
£^^2?SLT"! T a "! lySiS by SDS " PAGE fol,owed b y Coomassie staining and Western Wotting of 
duphcate ge s The level of nan A accumulation in these clones was equivalent to that detected from pICM 102 
One isolate was selected and named pIC1 1 1 31 . 

9 5 d ) Use of an alternative transcription terminator element . 

40 In these experiments, the trp. promoter and ricin-A fragment from pIC1 1 1 31 was excised by digestion with 

Ind thf n ¥^ ^ - ^ l3tter en2yme deaVeS b6tween the 3 ' t6rminus ° f the n^"- A oilnWJS 
and the trpA franscription terminator. The resulting fragment was excised from an agarose gel (2% NuStevI 

Tolton T' FM f C w B ^ producte ) and P" rified by Phenol and chloroform extractions followed by ethanof pZ 
45 the T 4 terminator between unique Sail and Spjil sites. 

J£h ted DNA W3S " sed to ^form competent E^ HB101 (BRL) and hybridisation screening used to 
S N 7 enCe °l nC ^ DNA 38 in PreVi ° US ex P erimen *- Positively hybridising clones were cSosen for 
plasm* DNA preparation followed by restriction analysis with EcoRI and Sail together to show the present 

of an appropriately sized fragment. — presence 

so One isolate with the correct construction was identified and named pICH 1 85. 

9 5 e > Generation inducible tetracycline selection vector 

The clone, pIC1 1 1 85 , was used to produce a further construct by subcloning the expression cassette into 

£a2Tta ' P ° 042 - P ' aSmid DNA W3S Pr6Pared fr ° m P,C ' 1185 and d ^ted with Eco^nS 
mS, r h f' Se an express,on cassette containing the trp. promote r/RBS1 /ricin-A <MRA22)7ra^ent/T 4 ie7- 

ISSi 1 was is ° ,ated by the method outlined in 94d and ligated witn plcl 0042 cut with law 
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Ligated DNA was used to transform Exoli HB101 . HB1 01 transformations were plated on L agar + tetracyc- 
line, and incubated at 37°C overnight and colonies screened by hybridisation with a 32 P labelled ricin-A DNA 
probe. 

In both cases, positively identified colonies were confirmed by restriction analysis of plasmid DNA using 
EcoRI/Sph l and EcoRI/Bgl ll digests. Three isolates were identified ie plCI1 187.1-3. 
Fig. 19 outlines the construction of plCI1185 and plCI1187. 



9.5 f) Clone selection 



10 The clones isolated were used to transform E.coli 71.18 and single colonies picked for clone selection 

studies. The resulting whole cell lysates were electrophoresed on duplicate SDS-PAGE gels, one of which was 
stained with Coomassie blue and the other used for Western blot analysis. 

The stained gel provided minimal data on ricin-A expression due to the presence of a co-migrating protein 
from E.coli 71 .18. Western blotting clearly indicated ricin-A expression in comparison to positive and negative 

15 control samples. One isolate was used in the fermentations described below. 

RICIN A CHAIN FERMENTATION 

Plasmid pICI 1187 was transformed into E. coli strain MSD68 and the resultant recombinant (MSD1051) 
20 purified and maintained on glycerol stocks at -80°C. 

An aliquot of the culture was removed from stock and streaked onto agar plates of L- tetracycline to separate 
single colonies after overnight growth at 37°C. A single colony of MSD 1051 was removed and resuspended 
in 10ml L- tetracycline broth and 100uJ immediately inoculated into each of 10 250ml Erhlenmeyer flasks con- 
taining 75ml L-tetracycline broth. After growth for 1 6 at 37°C on a reciprocating shaker the contents of the flasks 
25 were pooled and used to inoculate a fermenter containing 20L modified LCM50 growth medium. 

Composition of Modified LCM50 



Made up of distilled water 



™ 2 P0 4 3.0 

35 Na 2 HP0 4 6-0 

NaCl 0.5 

Casein hydrolysate (Oxoid LAI) 2.0 

^ (NH A ) 2 S0 4 10 .00 

Yeast Extract (Difco) 20.00 

Glycerol 35.00 

MgS0 4 . 7H 2 0 0.5 

K CaCl 2 . 2H 2 0 0.03 

Thiamine 0.008 

FeSO^VCitric Acid 0.94/0.02 

Trace element solution (TES) 0.5ml 



Fermentations were then carried out at a temperature of 37°C and pH, controlled by automatic addition of 
6M sodium hydroxide solution, of pH 6.7. The dissolved oxygen tension (dOT) set point was 50% air-saturation 
and was controlled by automatic adjustment of the fermenter stirrer speed. Air flow to the fermenter, initially 
55 20LVmin, corresponding to 1 volume per volume per minute (WM) was increased to45L/min when the fermenter 
stirrer speed approached 80-90% of its maximum. 

Throughout the fermentation samples were taken for measurement of optical density (OD550), cell dry 
weight and accumulation of ricin A chain within the cells. Ricin A chain accumulation was measured by scanning 
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Coomassie blue stained SDS-PAGE gels of whole cell lysates of the sampled bacteria as is well known in the 
art. 

4 1 / 2 h after inoculation, yeast extract (Difco) solution (225g/L) was pumped into the fermenters at a rate of 
1.7g/lVh. 

5 After 12h when OD^ reached approximately 50, and before the fermentation became oxygen-limited bac- 

teria were harvested on a Sorval RC3B centrifuge (700g, 30 min, 4°) and accumulated protein recovered from 
the bacteria. 

NOTE: 

10 

E. coli DS410 (also referred to as MSD68 herein) is well known (Dougan and Sherratt, Molecular and Gen- 
eral Genetics, Vol 151, p151-160, 1977) and has the published genotype F~ ara azi ton A lac Y min A min B 
rpsLmalAxyJmtf thi. This strain is freely available to the public, and moreover was deposited by the Applicants 
on 7 June 1985, under the Budapest Treaty, with the National Collections Of Industrial & Marine Bacteria Ltd, 
15 Aberdeen, Scotland under deposition number 12100. 

The cells were collected from the fermentation broth using a continuous disc stack intermittent discharge 
separator. The broth (501 from 2 x 251 fermentation) was initially transferred from the fermenters to a 501 trundle 
tank and transported to a contained system consisting of a number of holding tanks connected to the separator 
and homogenizer. 

20 The trundle tank was connected to this system and the broth pumped through the centrifugal separator at 

a flow rate of 401/h. The discharge rate was adjusted so that the centrifuge supernatant was clear by visual 
inspection of an eyeglass in the supernatant discharge line. The supernatant was collected in a kill tank con- 
taining .201 of 0.1 M sodium hydroxide sanitizing solution prior to disposal. The cells were resuspended in 40I 
of Buffer A (50mM sodium dihydrogen orthophosphate, 25mM ethylene diamine tetra acetic acid, 5mM ben- 

25 zamidine, 2mM dithiothreitol, pH 6.3 with 5N sodium hydroxide) and prechilled to 8°C in the solids receiver ves- 
sel. The suspended cells were then transferred back to the trundle tank via the homogenizer adjusted to a 
working pressure of 600 bar. The resulting homogenate (60!) was chilled to <20°C and make 0.5% with respect 
to polythenemine by the addition of 2.5I of a 10% (v/v) solution. The suspension was allowed to flocculate for 
1 0 min before transfer to the Holding Tank via the centrifugal separator. The clear supernatant was then steri- 

30 lized by purifying through a depth filter and a positively charged 0.2|* membrane filter. 

The sterile clarified supernatant was concentrated to a volume of 121 using aspiral cartridge cross flow fil- 
tration device and the solution brought to 40% saturation by the addition of 2.9KG of solid ammonium sulphate 
crystals. The solution was allowed to flocculate by gentle stirring overnight at 15°C and then centrifuged using 
the continuous flow centrifuge. The discharged slurry was stored at 70°C until required for further processing. 

35 The ammonium sulphate precipitate was thawed in the presence of 141 of Buffer B (50mM sodium dihyd- 

rogen orthophosphate, 25mM elthylene diamine tetracetic acid, 2mM dithiothreitol, pH 6.3 with 5N sodium hyd- 
roxide). After 30min the suspension was clarified by centrifugation and desalted by diafiltration against 701 of 
Buffer B and the conductivity checked that it had been reduced to below 3MS/cm. The desalted solution was 
claified further by centrifugation and processed immediately. 

40 The desalted solution was slowly added to a batch chromatography tank containing 2kg of DEAE-cellulose 

which had been equilibrated with 601 of Buffer B. After stirring for 6.5h the unbound r-ricin solution was pumped 
from the bottom of the tank through an 1 1 .3cm diam x 1 0cm column of packed and equilibrated DEAE-cellulose 
at a flow rate of 80ml/min. The bulk of the r-ricin A did not bind and was collected in a stainless steel vessel. 
The r-ricin A solution was adjusted to pH 5.5 with 1 M orthophosphonic acid and applied to a 1 0cm diameter 

45 x 10cm column of car bosy methyl agaros equilibrated with 101 of Buffer C (25mM sodium dihydrogen orthophos- 
phate, 5mM ethylene diamine tetra acetic acid, 2mM dithiothreitol, pH 5.5 with 5N sodium hydroxide). The r-ricin 
A bound to this column and after washing with 1 01 of Buffer C was eluted with Buffer D (25mm sodium dihyd- 
rogen orthophosphate, 5mM ethyl diamine tetracetic acid, 2mM dithiothreitol, 100mM sodium chloride, pH 5.5 
with 5N sodium hydroxide). The pure r-ricin A eluted as a single peak which was collected and stored at 4°C 

so as a sterile solution until required for further processing. The r-ricin A is stable under these conditions for up 
to 2 months. 
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# 



SEQUENCE LISTING 



10 



15 



20 



25 



30 



35 



40 



45 



50 



The following is a list of the sequences referred to in the 
application. The sequences are written in the conventional 5' to 
3'sense. 

SEQ ID No 1 

SEQUENCE LENGTH: 62 bases 
SEQUENCE TYPE: Nucleotide 
STRANDEDNESS : Single 
TOPOLOGY: Linear 

AATTCAGT ACT CCA CTG GGT CCA GCA AGC TCT CTG CCG CAG TCT TTC 47 
CTG CTG AAG TGT CTC 62 

SEQ ID No 2 

SEQUENCE LENGTH: 64 bases 
SEQUENCE TYPE: Nucleotide 
STRANDEDNESS : Single 
TOPOLOGY: Linear 

CTG TTC GAG ACA CTT CAG CAG GAA AGA CTG CGG CAG AGA GCT TGC 45 
TGG ACC CAG TGG AGT ACTG 64 

SEQ ID No 3 

SEQUENCE LENGTH: 60 bases 

SEQUENCE TYPE: Nucleotide 

STRANDEDNESS : Single 

TOPOLOGY: Linear 

GAA CAG GTA CGT AAA ATT CAA GGC GAT GGT GCG GCT CTG CAG GAA 45 

AAG CTG TGC GCA ACC 60 
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SEQ ID No 4 



5 



SEQUENCE LENGTH: 60 bases 

SEQUENCE TYPE: Nucleotide 

STRANDEDNESS : S i ngle 

TOPOLOGY: Linear 



TTT GTA GGT TGC GCA CAG CTT TTC CTG CAG AGC CGC ACC ATC GCC 
TTG AAT TTT ACG TAC 

SEQ ID No 5 

SEQUENCE LENGTH: 48 bases 
SEQUENCE TYPE: Nucleotide 
STRANDEDNE S S : S i ng 1 e 
TOPOLOGY: Linear 

TAC AAA CTG TGC CAC CCT GAG GAA CTG GTG CTG CTC GGT CAC TCT CTG 
SEQ ID No 6 

SEQUENCE LENGTH: 51 bases 
SEQUENCE TYPE: Nucleotide 
STRANDEDNESS: Single 
TOPOLOGY: Linear 

CGG GAT CCC CAG AGA GTG ACC GAG CAG CAC CAG TTC CTC AGG GTG 
GCA CAG 

SEQ ID No 7 

SEQUENCE LENGTH: 63 bases 
SEQUENCE TYPE: Nucleotide 
STRANDEDNESS: Single 
TOPOLOGY: Linear 

GGG ATC CCG TGG GCT CCA CTG AGC TCT TGC CCG TCC CAA GCT TTA 
CAA CTG GCA GGC TGC TTG 



45 



50 



55 
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SEQ ID No 8 

SEQUENCE LENGTH: 60 bases 
SEQUENCE TYPE: Nucleotide 
STRANDEDNESS : Single 
TOPOLOGY: Linear 

CTG GCT CAA GCA GCC TGC CAG TTG TAA AGC TTG GGA CGG GCA AGA 45 
GCT CAG TGG AGC CCA 60 



SEQ ID No 9 
15 SEQUENCE LENGTH: 63 bases 

SEQUENCE TYPE: Nucleotide 
STRANDEDNESS : Single 
TOPOLOGY: Linear 

20 

AGC CAG CTG CAC TCC GGT CTG TTC CTG TAC CAG GGT CTG CTG CAG 45 
GCT CTA GAA GGC ATC TCT 63 

25 SEQ ID No 10 

SEQUENCE LENGTH: 63 bases 
SEQUENCE TYPE: Nucleotide 
STRANDEDNESS: Single 

30 

TOPOLOGY: Linear 

TTC AGG AGA GAT GCC TTC TAG AGC CTG CAG CAG ACC CTG GTA CAG 45 
GAA CAG ACC GGA GTG CAG 63 

35 

SEQ ID No 11 

SEQUENCE LENGTH: 60 bases 
SEQUENCE TYPE: Nucleotide 

40 

STRANDEDNESS: Single 
TOPOLOGY: Linear 

CCT GAA TTG GGG CCC ACC CTG GAC ACA CTG CAG CTG GAC GTT GCC 45 
GAC TTC GCT ACT ACC 60 



50 
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SEQ ID No 12 
SEQUENCE LENGTH: 
SEQUENCE TYPE: 
STRANDEDNESS: 
TOPOLOGY: 



63 bases 
Nucleotide 
Single 
Linear 

TTG CCA TAT GGT ACT AGC GAA GTC GGC AAC GTC CAG CTG CAG TGT 
GTC CAG GGT GGG CCC CAA 



SEQ ID No 13 

SEQUENCE LENGTH: 63 bases 
SEQUENCE TYPE: Nucleotide 
STRANDEDNESS: Single 
TOPOLOGY: Linear 

ATA TGG CAA CAG ATG GAG GAA CTG GGT ATG GCT CCG GCA CTG CAG 
CCG ACT CAG GGT GCG ATG 

SEQ ID No 14 

SEQUENCE LENGTH: 60 bases 
SEQUENCE TYPE: Nucleotide 
STRANDEDNESS: Single 
TOPOLOGY: Linear 

TGC TGG CAT CGC ACC CTG ACT CGG CTG CAG TGC CGG AGC CAT ACC 
CAG TTC CTC CAT CTG 

SEQ ID No 15 

SEQUENCE LENGTH: 60 bases 
SEQUENCE TYPE: Nucleotide 
STRANDEDNESS: Single 
TOPOLOGY: Linear 

CCA GCA TTC GCC TCT GCT TTC CAG CGG CGC GCA GGC GGT GTT CTG 
GTT GCC TCC CAT CTT 



45 
63 



45 
63 



45 
60 



45 
60 
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SEQ ID No 16 

SEQUENCE LENGTH: 60 bases 
SEQUENCE TYPE: Nucleotide 
STRANDEDNESS : S i ngle 
TOPOLOGY: Linear 

GCT CTG AAG ATG GGA GGC AAC CAG AAC ACC GCC TGC GCG CCG CTG 45 
GAA AGC AGA GGC GAA 60 

SEQ ID No 17 

SEQUENCE LENGTH: 55 bases 

SEQUENCE TYPE: Nucleotide 

STRANDEDNESS: Single 

TOPOLOGY: Linear 

CAG AGC TTC CTC GAG GTG TCT TAC CGC GTT CTG CGT CAC CTG GCC 45 
CAG CCG TTAG 55 

SEQ ID No 18 
SEQUENCE LENGTH: 53 bases 
SEQUENCE TYPE: Nucleotide 
STRANDEDNESS : Single 
30 TOPOLOGY: Linear 

TCGACTTA CGG CTG GGC CAG GTG ACG CAG AAC GCG GTA AGA CAC CTC 47 



GAG GAA 



53 



SEQ ID No 19 

SEQUENCE LENGTH: 21 bases 
SEQUENCE TYPE: Nucleotide 
STRANDEDNESS: Single 
TOPOLOGY: Linear 

TACAACTGGCAGGCTGCTTGA 2 1 
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SEQ ID No 20 

SEQUENCE LENGTH: 21 bases 
SEQUENCE TYPE: Nucleotide 
STRANDEDNESS: Single 
TOPOLOGY: Linear 

GACGTTGCCGACTTCGCTACT 



SEQ ID No 23 
SEQUENCE LENGTH: 26 bases 
SEQUENCE TYPE: Nucleotide 
35 STRANDEDNESS: Single 
TOPOLOGY: Linear 
GCACCATCGCCTTGAATTTTACGTAG 



21 



SEQ ID No 21 

SEQUENCE LENGTH: 21 bases 

15 SEQUENCE TYPE: Nucleotide 

STRANDEDNESS: Single 

TOPOLOGY: Linear 

TGCCGGAGCCATACCCAGTTC 
20 SEQ ID No 22 21 

SEQUENCE LENGTH: 21 bases 

SEQUENCE TYPE: Nucleotide 
25 STRANDEDNESS: Single 

TOPOLOGY: Linear 

GCCTGCCAGTTGTAAAGCTTG 



21 



26 



SEQ ID No 24 

SEQUENCE LENGTH: 62 bases 
SEQUENCE TYPE: Nucleotide 
45 STRANDEDNESS: Single 
TOPOLOGY: Linear 

AATTCAGT ACT CCA CTG GGT CCA GCA AGC TCT CTG CCG CAG TCT TTC 47 
so CTG CTG AAG TCT CTC 

62 



55 
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SEQ ID No 25 

SEQUENCE LENGTH: 64 bases 
SEQUENCE TYPE: Nucleotide 
5 STRANDEDNESS: Single 
TOPOLOGY: Linear 

CTG TTC GAG AGA CTT CAG CAG GAA AGA CTG CGG CAG AGA GCT TGC 45 
10 TGG ACC CAG TGG AGT A CTG 64 

SEQ ID No 26 

SEQUENCE LENGTH: 60 bases 
SEQUENCE TYPE: Nucleotide 
STRANDEDNESS : Single 
TOPOLOGY: Linear 
20 GAA CAG GTA CGT AAA ATT CAA GGC AGC GGT GCG GCT CTG CAG GAA 45 
AAG CTG TGC GCA ACC 60 



SEQ ID No 27 
SEQUENCE LENGTH: 60 bases 
SEQUENCE TYPE: Nucleotide 
STRANDEDNESS: Single 
30 TOPOLOGY: Linear 

TTT GTA GGT TGC GCA CAG CTT TTC CTG CAG AGC CGC ACC GCT GCC 45 
TTG AAT TTT ACG TAC 60 



SEQ ID No 28 

SEQUENCE LENGTH: 29 bases 
SEQUENCE TYPE: Nucleotide 
STRANDEDNESS : Single 
TOPOLOGY: Linear 

CTT CAG CAG GAA AGA ACG CGG CAG AGA GC 29 
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SEQ ID No 29 

SEQUENCE LENGTH: 33 bases 
SEQUENCE TYPE: Nucleotide 
STRANDEDNESS : Single 
TOPOLOGY: Linear 

GC TTG GGA AGA GCA AGA GCT CAG AGA AGC CCA C 
SEQ ID No 30 

SEQUENCE LENGTH: 168 + 166 bases 
SEQUENCE TYPE: Nucleotide 
STRANDEDNESS : Double 
TOPOLOGY: Linear 

AATTCTGGCA AATATTCTGA AATGAGCTGT TGACAATTAA TCATCGAACT 50 
GACCGT TTATAAGACT TTACTCGACA ACTGTTAATT AGTAGCTTGA 46 

AGTTAACTAG TACGCAAGTT CACGTAAAAA GGGTATCGAC 90 
TCAATTGATC ATGCGTTCAA GTGCATTTTT CCCATAGCTG 86 

30 AATGGTACCC GGGGATCCTC TAGAGTCGAC CTGCAGGCAT GCAAGCTTAG 140 
TTACCATGGG CCCCTAGGAG ATCTCAGCTG GACGTCCGTA CGTTCGAATC 136 



20 



25 



35 



32 



CCCGCCTAAT GAGCGGGCTT TTTTTTAT 168 
GGGCGGATTA CTCGCCCGAA AAAAAATAGC 166 



SEQ ID No 31 
40 SEQUENCE LENGTH: 534 bases 

SEQUENCE TYPE: Nucleotide with corresponding protein 
STRANDEDNESS : Single 
TOPOLOGY: Linear 

45 

AATTCAGT ACT CCA CTG GGT CCA GCA AGC TCT CTG CCG CAG TCT TTC CTG 50 
Thr Pro Leu Gly Pro Ala Ser Ser Leu Pro Gin Ser Phe Leu 
50 1 5 10 
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CTG AAG TCT CTC GAA CAG GTA CGT AAA ATT CAA GGC AGC GGT GCG GCT 
Leu Lys Ser Leu Glu Gin Val Arg Lys He Gin Gly Ser Gly Ala Ala 
15 20 25 30 



98 



10 



CTG CAG GAA AAG CTG TGC GCA ACC TAC AAA CTG TGC CAC CCT GAG GAA 
Leu Gin Glu Lys Leu Cys Ala Thr Tyr Lys Leu Cys His Pro Glu Glu 
35 40 45 



146 



15 



CTG GTG CTG CTC GGT CAC TCT CTG GGG ATC CCG TGG GCT CCA CTG AGC 
Leu Val Leu Leu Gly His Ser Leu Gly He Pro Trp Ala Pro Leu Ser 
50 55 60 



194 



20 



TCT TGC CCG TCC CAA GCT TTA CAA CTG GCA GGC TGC TTG AGC CAG CTG 
Ser Cys Pro Ser Gin Ala Leu Gin Leu Ala Gly Cys Leu Ser Gin Leu 
65 70 75 



242 



25 



30 



35 



40 



CAC TCC GGT CTG TTC CTG TAC CAG GGT CTG CTG CAG GCT CTA GAA GGC 290 

His Ser Gly Leu Phe Leu Tyr Gin Gly Leu Leu Gin Ala Leu Glu Gly 
80 85 90 

ATC TCT CCT GAA TTG GGG CCC ACC CTG GAC ACA CTG CAG CTG GAC GTT 338 

He Ser Pro Glu Leu Gly Pro Thr Leu Asp Thr Leu Gin Leu Asp Val 
9 ^ 100 105 no 

GCC GAC TTC GCT ACT ACC ATA TGG CAA CAG ATG GAG GAA CTG GGT ATG 386 

Ala Asp Phe Ala Thr Thr He Trp Gin Gin Met Glu Glu Leu Gly Met 
115 120 125 



45 



GCT CCG GCA CTG CAG CCG ACT CAG GGT GCG ATG CCA GCA TTC GCC TCT 
Ala Pro Ala Leu Gin Pro Thr Gin Gly Ala Met Pro Ala Phe Ala Ser 
130 135 140 



434 



50 



GCT TTC CAG CGG CGC GCA GGC GGT GTT CTG GTT GCC TCC CAT CTT CAG 
Ala Phe Gin Arg Arg Ala Gly Gly Val Leu Val Ala Ser His Leu Gin 
1*5 145 155 



482 
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»CC TTC CTC GAG GTG ICT TAC CCC GTT CTG CGT CAC CTG GCC CAG CCG 
S.r Ph. U,„ Glu Val s.r Tyr Ar g Val Leu Ar g Bls ^ 41a G1 „ Pr0 



530 



160 
TAA G 



165 170 m 



534 



SEQ ID No 32 

SEQUENCE LENGTH: 81 bases 

SEQUENCE TYPE: Nucleotide 

15 STRANDEDNESS: Single 

TOPOLOGY: Linear 

2Q GAATTCAACA AAACGGTTGA CAACATGAAG TAAACACGGT ACGATGTACC 50 
ACAAGTTCAC GTAAAAAGGG TATCGACAATG 

8 1 

SEQ ID No 33 
25 SEQUENCE LENGTH : 67 + 67 bases 
SEQUENCE TYPE: Nucleotide 
STRANDEDNESS: Double 
TOPOLOGY: Linear 



45 



TCGACATTAT ATTACTAATT AATTGGGGAC CCTAGAGGTC CCCTTTTTTA TTTTAAAAAG 
GTAATA TAATGATTAA TTAACCCCTG GGATCTCCAG GGGAAAAAAT AAAATTTTTC 



60 
56 



CATGCGA 67 
GTACGCTTCGA 67 

SEQ ID No 34 

SEQUENCE LENGTH: 118 bases 

SEQUENCE TYPE: Nucleotide 

STRANDEDNESS: Single 

TOPOLOGY: Linear 

„ AATTCTGGCA AATATTCTGA AATGAGCTGT TGACAATTAA TCATCGAACT 50 
AGTTAACTAG TACGCAGAGC TCAATCTAGA GGGTATTAAT AATGTTCCCA 100 
TTGGAGGATG ATTAAATG 

118 

55 
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SEQ ID No 35 
SEQUENCE LENGTH: 
SEQUENCE TYPE: 
STRANDEDNESS: 
TOPOLOGY: 



EP 0 502 637 A2 



35 + 35 bases 
Nucleotide 
Double 
Linear 



10 



AGCTCCATAT GGTACCAGAT CTCTCGAGAG TACTT 

GGTATA CCATGGTCTA GAGAGCTCTC ATGAAGATC 



35 
35 



15 



20 



SEQ ID No 36 
SEQUENCE LENGTH: 
SEQUENCE TYPE: 
STRANDEDNESS: 
TOPOLOGY: 



23 + 15 bases 
Nucleotide 
Double 
Linear 



25 AGCTCAGCTG CAGCATATGG TAC 
GTCGAC GTCGTATAC 



23 
15 



30 

SEQ ID No 37 

SEQUENCE LENGTH: 12+12 bases 

SEQUENCE TYPE: Nucleotide 

35 STRANDEDNESS: Double 

TOPOLOGY: Linear 



40 



TCGACATTAT ATTACTAATT AATTGGGGAC CCTAGAGGTC CCCTTTTTTA TTTTAAAAAG 60 
GTAATA TAATGATTAA TTAACCCCTG GGATCTCCAG GGGAAAAAAT AAAATTTTTC 56 



CATGCGGATC CC 72 
45 GTACGCCTAG GGGAAC 72 



so 
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SEQ ID No 38 
SEQUENCE LENGTH: 
SEQUENCE TYPE: 
STRANDEDNESS: 
TOPOLOGY: 



84 bases 
Nucleotide 
Single 
Linear 



AAT TCA ACA AAA CGG TTG ACA ACA TGA AGT AAA CAC GGT ACG ATG 
TAC CAC AAG TTC ACG TAA AAA GGG TAT CGA CAA TGG TAG 



45 
84 



15 



20 



SEQ ID No 39 
SEQUENCE LENGTH: 
SEQUENCE TYPE: 
STRANDEDNESS: 
TOPOLOGY: 



76 bases 
Nucleotide 
Single 
Linear 



25 CAT TGT CGA TAC CCT TTT TAC GTG AAC TTG TGG TAC ATC GTA CCG 
TGT TTA CTT CAT GTT GTC AAC CGT TTT GTT G 



45 
76 



30 

SEQ ID No 40 

SEQUENCE LENGTH: 24 + 24 bases 

SEQUENCE TYPE: Nucleotide 

35 STRANDEDNESS: Double 

TOPOLOGY: Linear 



40 



AATTCGCATG CGG ATC CATC GATC 24 
GCGTAC GCCTAGGTAG CTAGAGCC 24 



SEQ ID No 41 
45 SEQUENCE LENGTH: 174/177 Amino acids 
SEQUENCE TYPE: Amino acid 
TOPOLOGY: Linear 

50 
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Thr Pro Leu Gly Pro Ala Ser Ser Leu Pro Gin 

15 10 

Ser Phe Leu Leu Lys Cys Leu Glu Gin Val Arg 

15 20 



10 



Lys lie Gin Gly Asp Gly Ala Ala Leu Gin Glu 

25 30 



15 



Lys Leu (Val Ser Glu) m Cys Ala Thr Tyr Lys Leu 



35 



40 



20 



Cys His Pro Glu Glu Leu Val Leu Leu Gly His 
45 50 



25 



Ser Leu Gly lie Pro Trp Ala Pro Leu Ser Ser 
55 60 



30 



Cys Pro Ser Gin Ala Leu Gin Leu Ala Gly Cys 
65 70 



35 



40 



45 



Leu Ser Gin Leu His Ser Gly Leu Phe Leu Tyr 
75 80 85 

Gin Gly Leu Leu Gin Ala Leu Glu Gly lie Ser 

90 95 

Pro Glu Leu Gly Pro Thr Leu Asp Thr Leu Gin 
100 105 

Leu Asp Val Ala Asp Phe Ala Thr Thr lie Trp 
110 115 



50 



Gin Gin Met Glu Glu Leu Gly Met Ala Pro Ala 
120 125 
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Leu Gin Pro Thr Gin Gly Ala Met Pro Ala Phe 



130 



135 



140 



Ala Ser Ala Phe Gin Arg Arg Ala Gly Gly Val 



145 150 



Leu Val Ala Ser His Leu Gin Ser Phe Leu Glu 



155 160 



Val Ser Tyr Arg Val Leu 



Arg His Leu Ala Gin 



" 5 170 



Pro 

(vhere m is 0 or 1). 



SEQ ID NO 42 
T7A3 PROMOTER SEQUENCE 
SEQUENCE LENGTH: 46 bases 
SEQUENCE TYPE: Nucleotide 
STRANDEDNESS: single 
TOPOLOGY: linear 



AACAAAACGG TTGACAACAT GAAGTAAACA CGGTACGATG TACCAC 



SEQ ID NO 43 

lacO SEQUENCE 

SEQUENCE LENGTH: 22 bases 

SEQUENCE TYPE: Nucleotide 

STRANDEDNESS: single 

TOPOLOGY: linear 

AATTGTGAGC GGATAACAAT TT 



46 
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SEQ ID NO 44 

SEQUENCE LENGTH: 21 bases 
SEQUENCE TYPE: Nucleotide 
STRANDNESS: single 
TOPOLOGY: linear 

GATAACAACA TATTCCCCAA A 21 



SEQ ID NO 45 

SEQUENCE LENGTH: 21 bases 
SEQUENCE TYPE: Nucleotide 
STRANDNESS: single 
TOPOLOGY: linear 

GATAACAACA TGGTACCCAA A 21 



SEQ ID NO 46 

SEQUENCE LENGTH: 21 bases 
SEQUENCE TYPE: Nucleotide 
STRANDNESS: single 
TOPOLOGY: linear 

AACAACATGG TACCCAAACA A 21 



SEQ ID NO 47 

SEQUENCE LENGTH: 86 bases 
SEQUENCE TYPE: Nucleotide 
STRANDNESS: single 
TOPOLOGY: linear 

AAAAAGGGTA TCGACATGGT ACCCGGGGAT CCACCTCAGG GTGGTCTTTC 
ACATTAGAGG ATAACAACAT GGTACCCAAA CAATAC 86 



47 
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Claims 

1. A vector which comprises an inducible selection gene, and a sequence which codes for a heterologous 
polypeptide. 

2. A vector as claimed in claim 1 wherein the selection gene comprises the tetA and tetR genes. 

3. A vector as claimed in claims 1 or 2 which includes a DNA sequence which is capable of conferring stability 
on the vector. 

4. A vector as claimed in claim 3 wherein the DNA sequence capable of conferring stability on the vector 
comprises the cer sequence. 

5. A vector as claimed in any one of the preceding claims which includes a transcription terminator as found 
at the terminus of gene 32 of bacteriophage T4. 

6. A vector as claimed in any one of the preceding claims which includes a promoter selected from the trp 
promoter and the T7A3 promoter. 

7. A vector which comprises a replicable plasmidic expression vehicle comprising a promoter, the cer sequ- 
20 ence, a transcription terminator as found at the terminus of gene 32 of bacteriophage T4, an origin of rep- 
lication and a DNA sequence which codes for a heterologous polypeptide. 

8. A vector as claimed in any one of the preceding claims wherein the heterologous polypeptide is selected 
from ricin A and G-CSF or an analogue thereof. 

25 

9. A host capable of expressing a heterologous polypeptide, which host comprises a vector as claimed in 
any one of the preceding claims. 

10. A process for preparing a host as defined in claim 9, said process comprising transforming a host by inser- 
3o tion therein of a vector as defined in any one of claims 1 to 8. 

11. A process for preparing a polypeptide, said process comprising culturing a host as defined in claim 9 so 
that polypeptide is expressed. 

12. A replicable plasmidic expression vehicle which comprises an inducible selection gene comprising the tetA 
35 and tetR genes, and a DNA sequence which codes for a heterologous polypeptide. 

13. A host which comprises a replicable plasmidic expression vehicle as defined in claim 12. 

40 
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TRANSCRIPTION TERMINATION SEQUENCE 

Fig. 1(a) 



Sail 

5 ' TCGACATTATATTACTAATTAATTGGGGACCCTAGAGGTCCCCTTTTTTATTTTAA 
3 ' GTAATATAATGATTAATTAACCCCTGGGATCTCCAGGGGAAAAAATAAAATT 



S£hl Hindlll 
AAAGCATGCA 3 ' 

TTTCGTACGTTCGA 5 ' 



Fig. Kb) 

Sail 

5 ' TCGACATTATATTACTAATTAATTGGGGACCCTAGAGGTCCCCTTTTTTATTTTAA 
3 ' GTAATATAATGATTAATTAACCCCTGGGATCTCCAGGGGAAAAAATAAAATT 



S£hl BamHI Styl 
AAAGCATGCGGATCCC 3' 
TTTCGTACGCCTAGGGGAAC 5 ' 
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Fig. 5 (a J 

EcoRl 

AATTCTGGCA AATATTCTGA AATGAGCTGT TGAGAATTAA TCATCGAACT 
CACCGT TTATAAGACT TTACTCGACA ACTGTTAATT AGTAGCTTGA 

Hpal 

AGTTAACTAG TACGCAAGTT CACGTAAAAA GGGTATCGAC 
TCAATTGATC ATGCGTTCAA GTGCATTTTT CCCATAGCTG 

Kpnl BamHI Xbal Sail Pstl SphT 
AATGGTACCC GGGGATCCTC TAGAGTCGAC CTGCAGGCAT GCA-GCTTAG 
TTACCATGGG CCCCTAGGAG ATCTCAGCTG GACGTCCGTA CCTTCcIItc 

Clal 

CCCGCCTAAT GAGCGGGCTT TTTTTTAT 
GGGCGGATTA CTCGCCCGAA AAAAAATAGC 



Fig. 5(b) 



EcoRI 

AATTCTGGCA AATATTCTGA AATGAGCTGT TGACAATTAA TCATCGAACT 
Hpal 

AGTTAACTAG TACGCAGAGC TCAATCTAGA GGGTATTAAT AATGTTCCCA 
TTGGAGGATG ATTAATG 



54 
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Fig. 7. 

^411/EcoRI 



PvuII 
Ndel 
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Fig. 8. 



2461/EcoRI 
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Fig. 18. 

PARTIAL SEQUENCE OF PICI1102 
Trp Promoter 

TTCTGGCAAATATTCTGAAATGAGCTGTTGACAATTAATCATCGAACTAGTTAACTAGTA 
1 + + + + + + 6Q 

S-D rbs Met Kpnl BamHI 

CGCAAGTTCACGTAAAAAGGGTATCGACAAT GGTACC CGG GGATCCA CCTCAGGGTGGTC 
61 + + + + + + 12Q 

Stop Kpnl Ricin A > 

TTTCACATTAGAGGATAACAACATGGTACCCAAACAATACCCAATTATAAACTTTACCAC 
121 + + + + + + lg0 

MVPKQYP IINFTT 



AGCGGGTGCCACTGTGCAAAGCTACACAAACTTTATCAGAGCTGTTCGCGGTCGTTTAAC 
+ + + + + + 

AGATVQ SYTNFIRAVRGRLT 



AACTGGAGCTGATGTGAGACATGAAATACCAGTGTTGCCAAACAGAGTTGGTTTGCCTAT 
241 + + + + + + 30Q 

TGADVRHEIPVLPNRVGLPI 

AAACCAACGGTTTATTTTAGTTGAACTCTCAAATCATGCAGAGCTTTCTGTTACATTAGC 
301 + + + + , + 360 

NQRFI LVELSNHAELSVTLA 



CCTGGATGTCACCAATGCATATGTGGTCGGCTACCGTGCTGGAAATAGCGCATATTTCTT 
+ + + + + + 

LDVTNAYVVGYRAGNSAYFF 



TCATCCTGACAATCAGGAAGATGCAGAAGCAATCACTCATCTTTTCACTGATGTTCAAAA 
421 + + + + + + 480 

HPDNQEDAEAITHLFTDVQN 

TCGATATACATTCGCCTTTGGTGGTAATTATGATAGACTTGAACAACTTGCTGGTAATCT 
481 + + + + + + 540 

RYTFAFGGNYDRLEQLAGNL 



GAGAGAAAATATCGAGTTGGGAAATGGTCCACTAGAGGAGGCTATCTCAGCGCTTTATTA 
+ 4- + + + + 

RENI ELGNGPLEEAISALYY 



TTACAG TACTGGTGGCACTCAGCTTCCAACTCTGGCTCGTTCCTTTATAATTTGCATCCA 
601 + + + + + + 660 

YSTGGTQLPTLARSFIICIQ 
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Fig.W(cont) 



AATGATTTCAGAAGCAGCAAGATTCCAATATATTGAGGGAGAAATGCGCACGAGAATTAG 
661 + + + + + + 720 

MISEAARFQY1EGEMRTRIR 

GTACAACCGGAGATCTGCACCAGATCCTAGCGTAATTACACTTGAGAATAGTTGGGGGAG 
721 + + + + + + 7ao 

YNRRSAPDPSVITLENSWGR 

ACTTTCCACTGCAATTCAAGAGTCTAACCAAGGAGCCTTTGCTAGTCCAATTCAACTGCA 
781 + + + + + + 840 

LSTAIQESNQGAFASPIQLQ 

AAGACGTAATGGTTCCAAATTCAGTGTGTACGATGTGAGTATATTAATCCCTATCATAGC 
841 + + + + + + 900 

RRNGSKFSVYDVSILIPIIA 



TCTCATGGTGTATAGATGCGCACCTCCACCATCGTCACAGTTTTGATTGCTTATAAGGCC 
901 + + + + + + 960 

LMVYRCAPPPSSQF* 



Kpnl Xbal PstI SphI Hindlll 
AGT GGTACC CGGGGATCC TCTAGAG TCGAC CTGCAGG CATGC AAGCTTA GCCCGCCTAAT 
961 + + + + + +1020 



Terminator 

GAGCGGGCTTTTTTTTATCGACCGATGCCCTTGAGAGCCTTCAACCCAGTCAGCTCCTTC 
1021 + + + -f + + 1080 



CGGTGGGCGCGGGGCATGACTATCGTCGCCGCACTTATGACTGTCTTCTTTATCATGCAA 
1081 + + + + + + 114 0 
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